ADENO-ASSOCIATED VIRUS VECTORS 



Cross-Reference to Related Applications 
This application is a continuation under 37 C.F.R. 1.53(b) of U.S. 
application Serial No. 09/276,625, filed March 25, 1999, which is a continuation-in- 
part application claiming priority of invention under 35 U.S.C § 1 19(e) from U.S. 
application Serial No. 60/086,166 filed May 20, 1998, the disclosures of which 
applications are incorporated by reference herein. 

Statement of Government Rights 
This invention was made with a grant from the Govermnent of the United 
States of America (Grant No. DK/HL58340 from the National Mstitutes of Health). 
The Government may have certain rights in the invention. 

Background of the Invention 

Adeno-associated virus (AAV) is a non-pathogenic parvovirus with a 
single-stranded DNA genome of 4680 nucleotides. The genome may be of either 
plus or minus polarity, and codes for two groups of genes, Rep and Cap (Bems et 
aL, 1990). Mverted terminal repeats (ITRs), characterized by palindromic sequences 
producing a high degree of secondary sixucture, are present at botii ends of the viral 
genome. While other members of the parvovirus group replicate autonomously, 
AAV requires co-infection with a helper virus (i.e., adenovirus or herpes virus) for 
lytic phase productive replication, hi the absence of a helper virus, wild-type AAV 
(wtAAV) estabUshes a latent, non-productive infection with long-term persistence 
by integrating into a specific locus on chromosome 19, AAVSl, of the host genome 
through a Rep-facilitated mechanism (Samulski, 1993; Linden et al., 1996; Kotin et 
al., 1992). 

In contrast to wtAAV, the mechanism(s) of latent phase persistence of 
recombinant AAV (rAAV) is less clear. rAAV integration into the host genome is 
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not site-specific due to deletion of the AAV Rep gene (Ponnazhagan et al., 1997). 
Analysis of integrated proviral structures of both wild type and recombinant AAV 
have demonstrated head-to-tail genomes as the predominant structural forms, 

rAAV has recently been recognized as an extremely attractive vehicle for 
gene delivery (Muzyczka, 1992). rAAV vectors have been developed by 
substituting all viral open reading frames with a therapeutic minigene, while 
retaining the cis elements contained in two inverted terminal repeats (ITRs) 
(Samulski et al, 1987; Samulski et al., 1989). Following transduction, rAAV 
genomes can persist as episomes (Flotte et al., 1994; Afione et al., 1996; Duan et al., 
1998), or altematively can integrate randomly into the cellular genome (Bems et al., 
1996; McLaughUn et al, 1988; Duan et al., 1997; Fisher-Adams et al., 1996; Keams 
et al, 1996; Ponnazhagan et al, 1997). However, little is known about the 
mechanisms enabling rAAV vectors to persist in vivo or the identity of cellular 
factors which may modulate the efficiency of transduction and persistence. 
Although transduction of rAAV has been demonstrated in vitro in cell culture 
(Muzyczka, 1992) and in vivo in various organs (Kaplitt et al., 1994; Walsh et al., 
1994; Conrad et al., 1996; Herzog et al., 1997; Snyder et al., 1997), the mechanisms 
of rAAV-mediated transduction remain unclear. 

Moreover, while rAAV has been shown to be capable of stable, long-term 
transgene expression both in vitro and in vivo in a variety of tissues, the transduction 
efficiency of rAAV is markedly variable in different cell types. For example, rAAV 
has been reported to transduce lung epitheUal cells at low levels (Halbert et al., 
1997; Duan et al., 1998a), while high level, persistent transgene expression has been 
demonstrated in muscle, neurons and in other non-dividing cells (Kessler et al., 
1996; Fisher et al., 1997; Herzog et al., 1997; Xiao et al., 1996; Kaplitt et al, 1994; 
Wu et al, 1998; Ali et al., 1996; Bennett et al., 1997 Westfall et al., 1997). These 
tissue-specific differences in rAAV mediated gene transfer may, in part, be due to 
variable levels of cellular factors affecting AAV infectivity (i.e., receptors and co- 
receptors such as heparin sulfate proteoglycan, FGFR-1, and aVpS integrin) 



(Summerford et al, 1998; Qing et aL, 1999; Summerford et al., 1999) as well as the 
latent life cycle (i.e., nuclear trafficking of virus and/or the conversion of single 
stranded genomes to expressible forms) (Qing et al, 1997; Qing et al., 1998). 

Muscle-mediated gene trmsfer represents a very promising approach for the 
treatment of hereditary myopathies and several other metabolic disorders. Previous 
studies have demonstrated remarkably efficient and persistent transgene expression 
to skeletal muscle in vivo with rAAV vectors. Applications in this model system 
include the treatment of several inherited disorders such as Factor IX deficiency in 
hemophilia B and epo deficiencies (Kessler et al., 1996; Herzog et al., 1997). 
Although the conversion of low-molecular- weight rAAV genomes to high- 
molecular-weight concatamers has been inferred as evidence for integration of 
proviral DNA in the host genome, no direct evidence exists in this regard (Xiao et 
al., 1996; Clark et al., 1997; Fisher et al. 1997). Also, the molecular processes 
and/or structures associated with episomal long-term persistence of rAAV genomes, 
e.g., in nondividing mature myofibers, remains unclear. 

Thus, there is a need for rAAV vectors that have increased stability and/or 
persistence in host cells. Moreover, there is a need for vectors usefiil to express 
large open reading fi-ames. 

Summary of the Invention 

The present invention provides a recombinant adeno-associated virus 
(rAAV) vector comprising a nucleic acid segment formed by the juxtaposition of 
sequences in the AAV inverted terminal repeats (ITRs) which are present in a 
circular intermediate of AAV. The circular intermediate was isolated fi-om rAAV- 
infected cells by employing a recombinant AAV "shuttle" vector. The shuttle vector 
comprises: a) a bacterial origin of repUcation; b) a marker gene or a selectable gene; 
c) a 5 ' ITR; and d) a 3 ' ITR. Preferably, the recombinant AAV shuttle vector 
contains a reporter gene, e.g., a GFP, alkaline phosphatase or p-galactosidase gene, 
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a selectable marker gene, e.g., an ampicillin-resistance gene, a bacterial origin of 
replication, a 5' ITR and a 3' ITR. The vector is contacted with eukaryotic cells so 
as to yield transformed eukaryotic cells. Low molecular weight DNA ("Hirt DNA") 
from the transformed eukaryotic cells is isolated. Bacterial cells are contacted with 
5 the Hirt DNA so as to yield transformed bacterial cells. Then bacterial cells are 
identified which express the marker or selectable gene present in the shuttle vector 
and which comprise at least a portion of a circular intermediate of adeno-associated 
virus. Also, as described below, it was found that circularized interaiediates of 
rAAV impart episomal persistence to linked sequences in Hela cells, fibroblasts and 
^10 muscle cells, hi HeLa cells, the incorporation of certain AAV sequences, e.g., ITRs, 
Q from circular intermediates into a heterologous plasmid conferred a 10-fold increase 

in the stability of plasmid-based vectors in HeLa cells. Unique features of these 
transduction intermediates included the in vivo circularization of a head-to-tail 
fifi monomer as well as nmltimer (concatamers) episomal viral genomes with associated 

15 specific base pair alterations in the 5' viral D-sequence. The majority of circular 
intermediates had a consistent head-to-tail configuration consisting of monomer 
genomes (<3 kb) which slowly converted to large multimers of >12 kb by 80 days 
post-infection in muscle. Importantly, long-term transgene expression was 
associated with prolonged (80 day) episomal persistence of these circular 
20 intermediates. Thus, in vivo persistence of rAAV can occur through episomal 
circularized genomes which may represent prointegration intermediates with 
increased episomal stability. Moreover, as described below, co-infection with 
adenovirus, at high multiphcities of infection (MOI) capable of producing early 
adenoviral gene products, led to increases in the abundance and stability of AAV 
25 circular intermediates which correlated with an elevation in transgene expression 
from rAAV vectors. Thus, these results demonstrate the existence of a molecular 
structure involved in AAV transduction which may play a role in episomal 
persistence and/or integration. 
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Further, these results may aid in the development of non-viral or viral-based 
gene delivery systems having increased efficiency. For example, therapeutic or 
prophylactic therapies in which the present vectors are useful include blood 
disorders (e.g., sickle cell anemia, thalassemias, hemophilias, and Fanconi anemias), 
5 neurological disorders, such as Alzheimer's disease and Parkinson's disease, and 
muscle disorders involving skeletal, cardiac or smooth muscle, hi particular, 
therapeutic genes useful in the vectors of the invention include the P-globin gene, 
the Y-globin gene, the cystic fibrosis transmembrane conductance receptor gene 
(CFTR), the Fanconi anemia complementation group, a gene encoding a ribozyme, 
l'^ 10 an antisense gene, a low density lipoprotein (LDL) gene, a tyrosine hydroxylase 

fi gene (Parkinson's disease), a glucocerebrosidase gene (Gaucher's disease), an 

arylsulfatase A gene (metachromatic leukodystrophies) or genes encoding other 
polypeptides or proteins. Also within the scope of the invention is the inclusion of 
more than one gene in a vector of the invention, i.e., a plurality of genes may be 
1 5 present in an individual vector. Further, as a circular intermediate may be a 
concatamer, each monomer of that concatamer may comprise a different gene. 

For viral-based delivery systems, helper-free virus can be prepared (see WO 
95/13365) from circular intermediates or vectors of the invention. Altematively, 
liposomes, plasmid or virosomes may be employed to deliver a vector of the 
20 invention to a host or host cell. 

The increased persistence of circular intermediates or vectors having one or a 
plurality of ITRs may be due to the primary and/or secondary structure of the ITRs. 
The primary structure of a consensus sequence (SEQ ID N0:3) of ITRs formed by 
the juxtaposition and physical (phosphodiester bond) linkage of ITRs from AAV is 
25 shown in Figure 2C. However, as described hereinbelow, each ITR sequence may 
be incomplete, i.e., the ITR may be a subunit or portion of the full length ITRs 
present in the consensus sequence. Moreover, preferably, an isolated DNA segment 
of the invention is not the 165 bp double DD sequence (SEQ ID N0:7) disclosed in 
U.S. Patent No. 5,478,745, referred to as a "double sequence". 
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Moreover, the formation, persistence and/or abundance of molecules having 
the ITR sequences of the invention may be modulated by helper virus, e.g., 
adenoviral proteins and/or host cell proteins. Thus, the circular intermediates or 
vectors of the invention may be useful to identify and/or isolate proteins that bind to 
5 the ITR sequences present in those molecules. 

Therefore, the present invention provides an isolated and purified DNA 
molecule comprising at least one DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability, persistence or abundance of the 

10 isolated DNA molecule in a host cell. Preferably, the DNA molecule comprises at 
least a portion of a left (5 ') inverted terminal repeat (ITR) of adeno-associated virus. 
Also preferably, the DNA molecule comprises at least a portion of a right (3 ')- 
inverted terminal repeat of adeno-associated vims. The invention also provides a 
gene transfer vector, comprising: at least one first DNA segment, a biologically 

1 5 active subunit or variant thereof, of a circular intermediate of adeno-associated 

virus, which DNA segment confers increased episomal stability or persistence of the 
vector in a host cell; and a second DNA segment comprising a gene. Preferably, the 
second DNA segment encodes a therapeutically effective polypeptide. The first 
DNA segment comprises ITR sequences, preferably at least about 100, more 

20 preferably at least about 300, and even more preferably at least about 400, bp of 
adeno-associated virus sequence. A preferred vector of the invention is a plasmid. 

Thus, the vector of the invention is usefiil in a method of delivering and/or 
expressing a gene in a host cell, to prepare host cells having the vector(s), and in the 
preparation of compositions comprising such vectors. To deliver the gene to the 

25 host cell, a recombinant adenovirus helper virus may be employed. 

As described hereinbelow, the tibiahs muscle of mice was co-infected with 
rAAV Alkaline phosphatase (Alkphos) and GFP encoding vectors. The GFP shuttle 
vector also encoded ampicillin resistance and a bacterial origin of repUcation to 
allow for bacterial rescue of circular intermediates in Hirt DNA fi-om infected 
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muscle samples. There was a time dependent increase in the abundance of rescued 
plasmids encoding both GFP and Alkphos that reached 33% of the total circular 
intermediates by 120 days post-infection. Furthermore, these large circular 
concatamers were capable of expressing both GFP and Alkphos encoded transgenes 
following transient transfection in cell lines. Thus, concatamerization of AAV 
genomes in vivo occurs through intermolecular recombination of independent 
monomer circular viral genomes. Therefore, a plurahty of DNA segments, each in 
an individual rAAV vector, may be delivered so as to result in a single DNA 
molecule having a plurality of the DNA segments. For example, one rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment comprising 
a first open reading frame linked to a fourth DNA segment comprising a 3' ITR. A 
second rAAV vector comprises a first DNA segment comprising a 5' ITR Imked to 
a second DNA segment comprising a promoter operably linked to a third DNA 
segment comprising a second open reading firame linked to a fourth DNA segment 
comprising a 3' ITR. 

In another embodiment, one rAAV vector comprises a first DNA segment 
comprising a 5' ITR linked to a second DNA segment comprising a promoter 
operably hnked to a third DNA segment comprising the 5' end of an open reading 
fi-ame linked to fourth DNA segment comprising a 5 ' splice site linked to a fifth 
DNA segment comprising a 3' ITR. The second rAAV vector comprises a first 
DNA segment comprising a 5' ITR linked to a second DNA segment comprising a 3 ' 
splice site hnked to a third DNA segment comprising the 3 ' end of the open reading 
frame linked to a fourth DNA segment comprising a 3* ITR. Preferably, the second 
and third DNA segments together comprise DNA encoding, for example, CTFR, 
factor Vin, dystrophin, or erythropoietin. Also preferably, the second DNA segment 
comprises the endogenous promoter of the respective gene, e.g., the epo promoter. 

Thus, the invention provides a composition comprising: a first adeno- 
associated virus vector comprising linked DNA segments and at least a second 



adeno-associated viras comprising linked DNA segments. The linked DNA 
segments of the first vector comprise: a first DNA segment comprising a 5' ITR; a 
second DNA segment comprising at least a portion of an open reading frame 
operably linked to a promoter, wherein the DNA segment does not comprise the 
entire open reading frame; a third DNA segment comprising a splice donor site; and 
iv) a fourth DNA segment comprising a 3' ITR. The linked DNA segments of the 
second vector comprise a first DNA segment comprising a 5' ITR; a second DNA 
segment comprising a splice acceptor site; a third DNA segment comprising at least 
a portion of an open reading frame which together with the second DNA segment of 
the fnst vector encodes a full-length polypeptide; and a fourth DNA segment 
comprising a 3' ITR. Preferably, the second DNA segment of the first vector 
comprises a first exon of a gene comprising more than one exon and the third DNA 
segment of the second vector comprises at least one exon of a gene that is not the 
first exon. 

The invention also provides a method to transfer and express a polypeptide 
in a host cell. The method comprises contacting the host cell with at least two 
rAAV vectors. One rAAV vector comprises a first DNA segment comprising a 5 
ITR linked to a second DNA segment comprising a promoter operably linked to a 
third DNA segment comprising a first open reading frame linked to a fourth DNA 
segment comprising a 3' ITR. A second rAAV vector comprises a first DNA 
segment comprising a 5* ITR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising a second open reading 
frame linked to a fourth DNA segment comprising a 3 ITR. Altematively, one 
rAAV vector comprises a first DNA segment comprising a STTR linked to a second 
DNA segment comprising a promoter operably linked to a third DNA segment 
comprising the 5' end of an open reading frame linked to foxirth DNA segment 
comprising a 5 ' splice site linked to a fifth DNA segment comprising a 3* ITR. The 
second rAAV vector comprises a fu-st DNA segment comprising a 5' ITR linked to 
a second DNA segment comprising a 3 ' splice site linked to a third DNA segment 
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comprising the 3 ' end of the open reading frame hnked to a fourth DNA segment 
comprising a 3 ITR. The host cell is preferably contacted with both of the vectors, 
concurrently, although it is envisioned that the host cell may be contacted with each 
vector at a different time relative to the contact with the other vector(s). 

Also provided is a method in which the composition of the invention is 
administered to the cells or tissues of an animal For example, rAAV vectors have 
shown promise in transferring the CFTR gene into airway epitheUal cells of animal 
models and nasal sinus of CF patients. However, high level expression of CFTR has 
not been achieved due to the fact lhat AAV cannot accommodate the full-length 
CFTR gene together with a potent promoter. A number of studies have tried to 
optimize rAAV-mediated CFTR expression by utilizing truncated or partially 
deleted CFTR genes together with stronger promoters. However, it is currently 
unknown what effect deletions within the CFTR gene may have on 
complementation of bacterial colonization defects in the CF airway. Therefore, the 
present invention includes the administration to an animal of a composition of the 
invention comprising at least two rAAV vectors which together encode CTFR. The 
present invention is useful to overcome the current size limitation for transgenes 
within rAAV vectors, and allows for the incorporation of a larger transciptional 
regulatory region, e.g., a stronger heterologous promoter or the endogenous CFTR 
promoter. 

Brief Description of the Figures 
Figure 1 . Structure of proviral shuttle vector and the predicted structure of 
rAAV circular intermediate monomers. With the aid of a rAAV c/^-acting plasmid, 
pCisAV.GFP3ori (Panel A), AV.GFP3ori recombinant virus was produced 
(Panel B). This vector encoded a GFP transgene cassette, an ampicillin resistance 
gene (amp), and a bacterial repHcation origin (ori). The predominant form of 
circular intermediates isolated following transduction of Hela cells with 
AV.GFP3ori consisted of head-to-tail monomers (Panels C and D). 



Figure 2. Structural analysis of rAAV circular intermediates in Hela cells. 
Circular rAAV intermediate clones isolated from AV.GFPSori infected Hela cells 
were analyzed by diagnostic restriction digestion with Asel, SphI, and PstI together 
with Southern blotting agamst ITR, GFP, and Stuffer ^^P-labeled probes. In panel 
A, four clones representing the diversity of intermediates found (pl90, p333, p280, 
and p345) gave a diagnostic PstI (P) restriction pattern (3kb and 1 .7kb bands) 
consistent with a circular monomer or multimer intact genome [agarose gel (Left) 
and Southern blot (Right)]. SphI (S) digestion demonstrated existence of a single 
ITR (pi 90), two URs in a head-to-tail orientation (p333 and p280), and three ITJts 
(p345) in isolated circular intermediates. The restriction pattern of pCisAV.GFP3ori 
(U; uncut, P; PstI cut, S; SphI cut) and 1 kb DNA ladder (L) are also given for 
comparison. One additional circular form (p340) was repetitively seen and had an 
unidentifiable structure which lacked intact ITR sequences. Circular concatamers 
were identified by partial digestion with Asel for clones p280 (dimer) and p333 
(monomer) as is shown in Panel B. Sequence analysis (Panel C) of six clones with 
identical restriction patterns to p333 (Panel A) was performed usmg primers 
(indicated by arrows) juxtaposed to the partial p5 promoter (dotted line) and ITRs 
(sohd hne). The top sequence represents the proposed head-to-tail structure of intact 
ITR arrays with ahgmnent of sequence derived from individual clones. The junction 
of the inverted ITRs is marked by mverted arrowheads (at 251bp). Several 
consistent bp changes (shaded) were noted in the 5 'ITR D-sequence (boxed) within 
four clones (p79, p81, p87, and p88). All bp changes are indicated in lower case 
letters. 

Figure 3. Adenovirus augments AAV circular intermediate formation in 
Hela cells. Infection of Hela cells with increasing doses (0, 500, and 5000 
particles/cell) of recombinant El-deleted adenovirus (Ad.CMVlacZ) leads to 
substantial expression of E2a 72kd DNA Binding Protein, as demonstrated by 
indirect inmiunofluorescent staining for DBP at 72 hours post-infection (Panel A). 
Co-infection of Hela cells with AdCMVlacZ (5000 particles/cell) and AV.GFP3ori 
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(1000 DNA particles/cell) led to substantial augmentation of rAAV GFP transgene 
expression (Panel B). Augmentation in rAAV GFP transgene expression in the 
presence of increasing amounts (0, 500, 5000 and 10000 particles/cell) of 
recombinant Ad.CMVlacZ was quantified by FACS analysis at 72 hour 
post-infection (Panel C). Results demonstrate the mean (+/-SEM) for two 
experiments performed in duplicate. In addition, an aliquot of cells was split (1:10) 
at the time of FACS analysis and GFP colony forming units (CFU) per lOX field 
were quantified at 6 days (CPE denotes significant cytopathic effects at an 
adenoviral MOI of 10,000 particles/cell and was not quantified for GFP colonies). 
Hirt DNAs from AV.GFP3ori (1000 DNA particles/cell) infected Hela cells with or 
without co-infection with Ad.CMVlacZ (5,000 p^icles/cell) were used to transform 
£. colL The total number of ampicillin-resistant bacterial CFU (Panel D) and total 
number of head-to-tail circular intermediates CFU (Panel E) are given for a 
representative experiment. Greater than 20 clones for each time point were 
evaluated by Southem blot (see Figure 2 for detail). Zero hour controls were 
performed by mixing an equivalent amount of AV.GFPSori virus as used in 
experiments with mock infected cellular lysates prior to Hirt purification. Panel F 
depicts the abxmdance of head-to-tail circular intermediates as a percentage of total 
ampicillin-resistant bacterial CFU isolated from Hirt DNA. 

Figure 4. Formation of rAAV head-to-tail circular intermediates following 
in vivo transduction of muscle. The tibiahs anterior muscle of 4-5 week old 
C57BL/6 mice were infected with AV.GFPSori (3 X 1010 particles) m HEPES 
buffered saline (30 ^\). GFP expression (Panel A) was analyzed by direct 
immunofluorescence of freshly excised tissues and/or in formalin-fixed 
cryopreserved tissue sections in four independently injected muscles harvested at 0, 
5, 10, 16, 22 and 80 days post-infection. GFP expression was detected at low levels 
beginning at 10 days and was maximum at 22 days post-infection. Expression 
remained stable to 80 days at which time greater than 50% of the tissue was positive 
(see 80 day tissue cross section counter stained with propidium iodide, panel A). 
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Hirt DNA was isolated from muscle samples at each of the various time points and 
after points was used to transform E. coli. Rescued plasmids (p439, pl6, pl7) were 
analyzed by Southern blotting in Panel B showing an agarose gel on left and ITR 
probed blot on right. U:uncut, P:PstI cut, and S:SphI cut. The schematic drawing of 
the most predominant type of head-to-tail circular AAV intermediate plasmids 
rescued from bacteria is given in the right of Panel B and shows the structure of pi 7 
as an example. Other typical clones included those with less than two ITRs as 
shown for pl6. SphI digestion of pl6 and pl7 plasmids released ITR hybridizing 
fragments of approximately 140 and 300 bp, respectively. The slightly lower 
mobility then predicted for these ITR fragments likely represents anomalous 
migration due to the high secondary structure of inverted repeats within ITRs. 
Sequence analysis of pl7 and pl6 using nested primers to 5' and 3 '-ITRs also 
confirmed the ITR orientations shown to the right of the gel. Additional restriction 
enzyme analyses to determine this structure included double and single digests with 
SphI, PstI, Asel, and/or Smal. An example of an atypical clone (p439) rescued 
from bacteria with unknown structure is also shown. 

Figure 5. Frequency of circular intermediate formation in muscle following 
transduction with rAAV. Hirt DNAs isolated from rAAV infected tibiahs muscle 
were used to transform E. coli and the rescued plasmids analyzed by Southem 
blotting (greater than 20 clones were analyzed from at least two independent muscle 
samples for each time point). The averages of total head-to-tail circular intermediate 
clones (line) and ampicilUn resistant bacterial clones (bar) isolated from each tibialis 
anterior muscle at 0, 5, 10, 16, 22 and 80 days post-infection are summarized in 
Panel A. Only plasmids which contained 1-2 ITRs were included in the estimation 
of total head-to-tail circular intermediates. Plasmids which demonstrated an 
absence of ITR hybridizing SphI fragments (between 150 to 300 bp) were omitted 
from the calculations. Panel B demonstrates the diversity of ITR arrays found in 
head-to-tail circular intermediates at 80 days post-infection. This panel depicts a 
Southem blot probed with ITR sequences and represents circular intermediates with 
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1-3 ITRs. SphI fragments which hybridize to ITR probes indicate the size of 
inverted ITR arrays (marked by arrows to right of gel). Additional restriction 
enzyme analysis was used to determine the structure of monomer and multimer 
circular intermediates. Examples are shown for two multimer (pi 36 and pl43) 
circular intermediates which contain approximately three AAV genomes. 
Undigested plasmids of pl36 and pl43 migrate greater than 12 kb and is contrasted 
to the most predominant form of head-to-tail undigested circular intermediates at 22 
days which migrate at 2.5 kb. The digestion pattern of pi 36 is consistent with a 
uniform head-to-tail configuration of three genomes which is indistinguishable from 
digestion patterns of pl39 which contains one circularized genome (undigested pl39 
migrates at 2.5 kb, data not shown, also see examples pl7 in Figure 4). In contrast, 
pi 36 depicts a more complex head-to-tail multimer circular intermediate which has 
various deletions and duplications within the ITR arrays. Predicted structure of five 
representative intermediates is schematically shown in Panel C, 

Figure 6. Molecular size of circular intermediates in muscle. HirtDNA 
from AV.GFP3ori infected muscle was size fractionated by electrophoresis and 
various molecular weight fractions transformed into E. colL Results demonstrate 
the abundance of circular intermediates at each of the given molecular weights at 22 
and 80 days post-infection with the rAAV shuttle vector. Structure of circular 
intermediates were confirmed by Southem blot restriction analysis. 

Figure 7. Head-to-tail circular intermediates demonstrate increased stability 
of GFP expression following transient transfection in Hela cells. Subconfluent 
monolayers of Hela cells were co-transfected with p81, p87, or pCMVGFP and 
pRS VlacZ as an internal control for transfection efficiency as described in the 
methods. Panel A demonstrates the expansion of GFP clones after one passage 
(arrows). Quantification of clone size and numbers are shown in Panel B. Clone 
size represents the mean raw values while clone numbers are normalized for 
transfection efficiency as determined by X-gal staining for pRS VlacZ. The data at 
the top of bar graph values for each construct in Panel B represents quantification of 
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GFP clones after second passage (also normalized for transfection efficiency). 
Results indicate the mean (+/-SEM) of duplicate experiments with greater than 20 
fields quantified for each experimental point. The persistence of transfected p81 and 
pCMVGFP plasmid DNA at passage-7 post-transfection was evaluated by genomic 
Southem blot of total cellular DNA hybridized against ^^P-labeled GFP probe (Panel 
C, results from two independent transfections are shown). U:uncut, C:PstI cut. The 
migration of uncut dimer and monomer plasmids forms are marked on the left. PstI 
digestion of the plasmids results in bands at 4.7 kb (pCMVGFP, single PstI site in 
plasmid) and 1.7 kb (p81, two PstI sites flanking the GFP gene). To determine 
whether the head-to-tail ITR array within circular intermediates was responsible for 
increases in the persistence of GFP expression, the head-to-tail ITR DNA element 
was subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). Results in . 
Panel D compare the extent of luciferase transgene expression following 
transfection with pGL3 and pGL3(ITR) at 10 days (passage-2) post-transfection. 
Results are the mean (+/-SEM) for tripUcate experiments and are normalized for 
transfection efficiency using a dual renilla luciferase reporter vector (pRLS V40, 
Promega). 

Figure 8. Identification of adenoviral genes responsible for augmentation of 
AAV circular intermediate formation. Hela cells were infected with AV.GFP3ori 
(1000 DNA particles/cell) in the presence of wtAdS, dlS02 (E2a-deleted), and 
i/1 1004 (E4-deleted) adenovirus (at the indicated MOIs). Total number of head-to- 
tail circular intermediates fi-om Hirt DNA and the level of augmentation of GFP 
transgene expression (as determined by FACS) was quantified at 24 hours post- 
infection (Panel A). Results are the average of duplicate experiments. Panel B 
depicts results firom Southem blot analysis of Hirt DNA following hybridization to a 
GFP P^^-labeled probe. DNA loads were 10% of the total Hirt yield from a 35 mm 
plate of Hela cells. Infections were carried out identically to that described for 
Panel A. Arrows mark replication form concatamers (RfJ, dimers (RQ, monomers 
(RQ, and single-stranded AAV genomes (ssDNA), 
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Figure 9. Model for independent mechanistic interactions of adenovirus 
with lytic and latent phase aspects of the AAV Ufe cycle. The adenoviral E4 gene 
has been shown to augment the level of rAAV second strand synthesis giving rise to 
replication form dimers (Rfa) and monomers (RQ (Figure 8B). This augmentation 
leads to substantial increases in transgene expression from rAAV vectors and most 
closely mirrors lytic phase replication of wtAAV as head-to-head and tail-to-tail 
concatamers. hi contrast, E4 expression inhibits the formation of head-to-tail 
circular intermediates of AAV. Hence, it appears that increases m the amount of Rf^ 
and R4 double stranded DNA genomes does not increase the extent of circular 
intermediate formation. Such findings suggest that conversion of Rf^ and Rf^ to 
circular intermediates does not likely occur and implicates two mechanistically 
distinct pathway for their formation. Li support of this hypothesis, adenoviral E2a 
gene expression does not enhance the formation of RJ^ and Rf^ genomes but rather 
increase the abundance and/or stability of head-to-tail circular intermediates. 
Furthermore, in the absence of E4, E2a gene expression does not lead to 
augmentation of rAAV transgene expression. Since circular intermediates have 
increased episomal stability in muscle and in Hela cells, this molecular structure 
may be important in the latent phase of AAV persistence. Altematively, these 
circular intermediates may represent pre-integration complexes as previously 
hypothesized for Rep facilitated integration, hi the absence of Rep, circular 
intermediates may accumulate episomally in rAAV infected cells, hi summary, 
these findings support the notion that adenovirus may modulate both latent and lytic 
aspects of the AAV hfe cycle. 

Figure 10. hidividual chemical sequence of SphI fragments from p81 (A; 
SEQ ID N0:4), p79 (B; SEQ ID N0:5), and pl202 (C; SEQ ID N0:6) AAV circular 
intermediates. The ends of the sequence (underlined) represent SphI restriction 
enzyme sites within head-to-tail circular AAV genomes cloned with the AV- 
GFPSori shuttle virus. 
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Figure 1 1 . Chemical sequence homology of three AAV circular 
intermediates with various conformations of ITR arrays. Diversity in ITR arrays are 
evident from the non-conserved bases marked in lower case. The ends of the 
sequence (underlined) represent SphI restriction enzyme sites within head-to-tail 
circular AAV genomes cloned with the AV.GFPSori shuttle virus. 

Figure 12A. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p81. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 12B. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p79. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 12C. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p79. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 13. Persistence of GFP expression in developing Xenopw^' embryos 
microinjected with AAV circular intermediate isolate p8L The extent of GFP 
fluorescence in tadpoles reflects the stability of episomal or integrated microinjected 
plasmids. Bright field image on the left is of the p81 injected embryo. The p81 
injected embryo depicts fluorescence in nearly all cells by one week post-injection. 
Li contrast, a mosaic pattern of expression in a minority of cells in pCisAV.GFPori 
injected embryos. The pCisAV.GFPori plasmid contains the identical promoter 
sequences driving GFP gene expression and two ITRs separated by stuffer sequence. 
These findings demonstrate that specific structural characteristics found within AAV 
circular intermediates are responsible for increased persistence of transgene 
expression. 

Figure 14. Mechanistic scheme for determining pathways for rAAV circular 
concatamer formation. The two independent vectors used in these studies, 
AV.Alkphos and AV.GFP3.ori, are shown in Panel A . Restriction sites important 
in the structural analysis of circular intermediates are also shown. In Panel B, a 
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schematic representation of two potential models for circular concatamer formation 
is depicted, along with the methods to experimentally differentiate which of these 
processes is active in muscle. Following co-infection of the tibiahs muscle with 
AVAlkphos and AV.GFP3.ori, all subsequently rescued plasmids arise solely from 
circular intermediates containing AV.GFP3ori genomes. If rolling circular 
repKcation is the sole mechanism of concatamerization, only GFP expressing 
plasmids should be rescued. In contrast, if intemiolecular recombination between 
independently formed monomer circular intermediates is the mechanism of 
concatamerization, both GFP and GFP/Alkphos expressing plasmids should be 
rescued. 

Figure 15. Co-infection of tibiaUs muscle of mice with AV.Alkphos and 
AV.GFPSori. Transgene expression of rAAV infected tibialis muscle was 
determined at 14, 35, 80 (Panels A and AO, and 120 (Panels B-D) days following 
co-infection with 5 x 10^ DNA particles each of AV.Alkphos and AV.GFP3ori. The 
time course of transgene expression started around 14 days and peaked by 35-80 
days. The extent of co-infection of myofibers with both Alkphos and GFP rAAV 
was deteimined in serial sections of 80 and 120 day post-infection muscle samples. 
Panels A-C represent GFP fluorescence of formalin fixed, cryoprotected sections, 
while panels K'-C depict the histochemical staining for Alkaline phosphatase in 
adjacent serial sections. A short stauiing time (7 minutes) was necessary to observe 
v^ation in staining levels for comparison to GFP. It was found that longer staining 
times (30 minutes) saturated the Allq)hos signal. The boxed region in panels B and 
B' are enlarged in panels C and C, respectively. A more precise correlation of GFP 
and Alkphos staining in myofibers is given in Panel D in which co-localization of 
GFP and Alkphos expression was examined in the same section of a 120 day post- 
infected sample. This was performed by photographing the GFP fluorescent image 
prior to staining for Alkphos activity. The left panel of D shows a high power 
Nomarski photomicrograph of a group of myofibers (traced in red) , while the 
corresponding GFP and Alkphos staining pattems are shown in the right panel. 
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Photomicrographs of Alkphos staining were taken with a red filter to allow for 
superimposition of staining patterns with GFP fluorescence. Co-expression of 
Alkphos and GFP is shown within myofibers as a yellow/orange color Myofibers 
are marked as follows: (-) negative for both Allq)hos and GFP, (*) positive for only 
GFP, and (+) positive for both GFP and Alkphos, 

Figure 16. Rescue of circular intermediates and characterization of DNA 
hybridization pattems. Using the ampicillin resistance gene (amp) and bacterial ori 
incorporated into the AV.GFPSori vector, the extent of circular intermediate 
formation was assessed by rescuing amp resistant plasmids following transformation 
of 1/5 the isolated Hirt DNA into E. coli Sure cells. Twenty plasmids from each 
muscle sample were prepared and analyzed by slot blot hybridization against GFP, 
Alkphos, and Amp ^^P-labeled DNA probes. A representative group demonstrating 
the hybridization pattems is shown in Panel A. Panel B depicts the mean (+/-SEM) 
number of rescued bacterial plasmids that hybridized to either GPF alone, or to both 
GFP and Alkphos probes, following transformation of 1/5* of the Hirt DNA. These 
numbers were calculated from the percentage of plasmids hybridizing to GPF and/or 
Alkphos and the total CFU plating efficiency derived from the original 
transformation. In total, 3 independent muscle samples were analyzed for a total of 
60 plasmids at each time point. The percentage of GFP hybridization positive 
rescued plasmids that also demonstrated hybridization to Alkphos is shown in 
Panel C. These data demonstrate an increase in the abundance of rescued 
GFP/Alkphos co-encoding circular intermediates over time. 

Figure 17. Transgene expression from rescued circular intermediates. 
Rescued circular intermediate plasmids were transfected into 293 cells for 
assessment of their ability to express encoded transgenes. In these studies all GFP 
hybridization positive clones from at least two muscles were tested for each time 
point and scored for their ability to express GFP and AlkaUne phosphatase. In total 
at least 40 clones were evaluated for each time point. Three pattems of liansgene 
expression were observed following transfection of these plasmids: I) no gene 
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expression (Panel A), II) GFP expression only (Panel B), and m) GFP and Alkphos 
expression (Panel C). Panels A-C depict Nomarski photomicrographs (left) of GFP 
fluorescent fields (center) and Alkphos staining of a different field firom the same 
culture (right). The percentage of GFP hybridization positive clones that also 
5 expressed GFP is shown in Panel D. Additionally, this panel illustrates the 
percentage of GFP expressing clones also expressing Alkphos. 

Figure 18, Structural analysis of bi-functional concatamer circular 
intermediates. To fully characterize the nature of GFP and Alkphos co-expressing 
circular intermediates, detailed structural analyses were performed using restriction 
10 enzyme mapping and Southem blot hybridization with GFP, Alkphos, and ITR ^^P- 
labeled probes. Results fi^om Southem blot analysis of plasmid clone #33 (Panel A) 
and clone #5 (Panel C) are given as representative examples of circular 
intermediates isolated fi^om 80 and 35 day Hirt DNA of rAAV infected muscle, 
respectively. Agarose gels were run in triplicate for each of these clones and 
1 5 Southem blot filters were hybridized with one of the three DNA probes as indicated 
ly below each autoradiogram. Molecular weights (kb) are indicated to the left of the 

ethidium stained agarose gel and restriction enzymes are marked on the top of each 
gel/filter. Panels B and D give the deduced structure of plasmid clones #33 and #5, 
respectively, as based on Southem blot analysis. For ease of comparison with the 
20 restriction maps of the viral genomes given in Figure 14A, the position of restriction 
enzyme sites (kb) are marked with the indicated orientation of intact viral genomes. 
However, in clone #33 a deletion occurred between the Asel and Hindm site of a 
head-to-tail array between AV.Alkphos and AV.GFP3ori, as reflected by a 900 bp 
reduction in the anticipated size of HindlU/Notl and Clal/Asel fragments (marked by 
25 asterisks in Panel A). Furthermore, the SphI site flanking an ITR was ablated in 
clone #5 (bands effected by this deletion are marked by asterisks in Panel C). The 
deletion is not reflected in the overall concatamer since the exact region involved 
and/or the size of the deletion is unclear. Additionally, chemical sequence evidence 
of rescued circular intermediates suggests that the predominant form of ITR arrays 
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may be in a double-D structure (ie., one ITR flanked by two D-sequence rather than 
two ITRs) and hence ITR arrays containing fragments may appear 147 bp shorter 
than indicated. However, to more easily depict the orientation of viral genomes, the 
position of 5 'and 3' ITRs is indicated rather than representing a single ITR at these 
5 junctions. 

Figure 19. AppUcation of rAAV circular concatamers to deliver trans- 
spUcing vectors with large gene inserts. Panel A depicts two rAAV vectors 
encoding two halves of a cDNA (red) and flanked by splice site consensus 
sequences (brown). Panel B depicts one potential type of intermolecular concatamer 
10 following co-infection of cells with the independent vectors shown in panel A. Full 
length transgene mRNA can then be produced by splicing between these two vector 
encoded sequences within circular concatamers. 



Detailed Description of the Invention 

m 15 Definitions 

As used herein, the terms "isolated and/or purified" refer to in vitro 
preparation, isolation and/or purification of a nucleic acid molecule of the invention, 
so that it is not associated with in vivo substances. 

As used herein, a DNA molecule, sequence or segment of the invention 
20 preferably is biologically active. A biologically active DNA molecule of the 
invention has at least about 1%, more preferably at least about 10%, and more 
preferably at least about 50%, of the activity of a DNA molecule comprising ITR 
sequences from a circular intermediate of AAV, e.g., a DNA molecule comprising 
SEQ ID N0:3, SEQ ED N0:4, SEQ ID N0:5, SEQ ID N0:6, or a subunit or variant 
25 thereof The activity of a nucleic acid molecxile of the invention can be measured by 
methods well known to the art, some of which are described hereinbelow. For 
example, the presence of the DNA molecule in a recombinant nucleic acid molecule 
in a host cell results in episomal persistence and/or increased abundance of the 
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recombinant molecule in those cells relative to corresponding cells having a 
recombinant nucleic acid molecule lacking a DNA molecule of the invention. 

A variant DNA molecule, sequence or segment of the invention has at least 
about 70%, preferably at least about 80%, and more preferably at least about 90%, 

5 but less than 100%, contiguous nucleotide sequence homology or identity to a DNA 
molecule comprising ITR sequences from a circular intermediate of AAV, e.g., SEQ 
ID N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, a subunit thereof A variant 
DNA molecule of the invention may include nucleotide bases not present in SEQ ID 
N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, e.g., 5', 3' or intemal deletions 

10 or insertions, such as the insertion of a restriction endonuclease recognition site, so 
long as these bases do not substantially reduce the biological activity of the 
molecule. A substantial reduction in activity means a reduction in activity of greater 
than about 50%, preferably greater than about 90%. 

15 I. Identification of Nucleic Acid Molecules Falling Within th e Scone of the 
Invention 

A. Nucleic Acid Molecules of the Invention 

L Sources of the Nucleic Acid Molecules of the Invention 

Sources of nucleotide sequences from which the present nucleic acid 

20 molecules can be obtained include AAV infected cells, e.g., any vertebrate, 
preferably mammalian, cellular source. 

As used herem, the terms "isolated and/or purified" refer to in vitro isolation 
of a nucleic acid, e.g., DNA molecule from its natural cellular environment, and 
from association with other components of the cell, such as nucleic acid or 

25 polypeptide, so that it can be sequenced, replicated, and/or expressed. For example, 
"isolated nucleic acid" is RNA or DNA containing greater than about 50, preferably 
about 300, and more preferably about 500 or more, sequential nucleotide bases that 
comprise a DNA segment from a circular intermediate of AAV which contains at 
least a portion of the 5 ' and 3 ' ITRs and the D sequence, or a variant thereof, that is 
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complementary or hybridizes, respectively, to AAV ITR DNA and remains stably 
bound under stringent conditions, as defined by methods well known in the art, e.g., 
in Sambrook et al., 1989. Thus, the RNA or DNA is "isolated" in that it is free from 
at least one contaminating nucleic acid with which it is normally associated in the 
natural source of the RNA or DNA and is preferably substantially free of any other 
mammalian RNA or DNA. The phrase "free from at least one contaminating source 
nucleic acid with which it is normally associated" includes the case where the 
nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 
normally found in the source cell, e.g., in a vector or plasmid. An example of 
isolated nucleic acid within the scope of the invention is nucleic acid that shares at 
least about 80%, preferably at least about 90%, and more preferably at least about 
95%, sequence identity with SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5 or SEQ 
ID N0:6, or a subunit thereof 

As used herein, the term "recombinant nucleic acid" or '^preselected nucleic 
acid," e.g., "recombinant DNA sequence or segment" or "preselected DNA 
sequence or segment" refers to a nucleic acid, e.g., to DNA, that has been derived or 
isolated from any appropriate cellular source, that may be subsequently chemically 
altered in vitro, so that its sequence is not naturally occurring, or corresponds to 
naturally occurring sequences that are not positioned as they would be positioned in 
a genome which has not been fransformed with exogenous DNA. An example of 
preselected DNA "derived" from a source, would be a DNA sequence that is 
identified as a useful fragment within a given organism, and which is then 
chemically synthesized in essentially pure form. An example of such DNA 
"isolated" from a source would be a useftil DNA sequence that is excised or 
removed from said source by chemical means, e.g., by the use of restriction 
endonucleases, so that it can be fiirther manipulated, e.g., amplified, for use in the 
invention, by the methodology of genetic engineering. 
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Thus, recovery or isolation of a given fragnient of DNA from a restriction 
digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, removal 
of the gel section containing the desired fragment, and separation of the gel from 
DNA. See Lawn et al.. Nucleic Acids Res.. 9, 6103 (1981), and Goeddel et al., 
Nucleic Acids Res.. 8, 4057 (1980). Therefore, "preselected DNA" includes 
completely synthetic DNA sequences, semi-synthetic DNA sequences, DNA 
sequences isolated from biological sources, aad DNA sequences derived from RNA, 
as well as mixtures thereof 

Nucleic acid molecules having base pair substitutions (i.e., variants) are 
prepared by a variety of methods known in the art. These methods include, but are 
not limited to, isolation from a natural source (in the case of naturally occurring 
sequence variants) or preparation by ohgonucleotide-mediated (or site-directed) 
mutagenesis, PGR mutagenesis, and cassette mutagenesis of an earher prepared 
variant or a non-variant version of the nucleic acid molecule. 

Ohgonucleotide-mediated mutagenesis is a preferred method for preparing 
substitution variants. This technique is well known in the art as described by 
Adehnan et al., DNA. 2, 183 (1983). Briefly, AAV DNA is altered by hybridizing 
an oUgonucleotide encodmg the desired mutation to a DNA template, where the 
template is the single-stranded form of aplasmid or bacteriophage containing the 
unaltered or native DNA sequence of AAV. After hybridization, a DNA polymerase 
is used to synthesize an entire second complementary strand of the template that will 
thus incorporate the ohgonucleotide primer, and will code for the selected alteration 
in the AAV DNA. 

Generally, oUgonucleotides of at least 25 nucleotides in length are used. An 
optimal oUgonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the single- 



23 



stranded DNA template molecule. The oligonucleotides are readily synthesized 
using techniques known in the art such as that described by Crea et al., Proc. Natl. 
Acad. Sci. U.S.A.. 75, 5765 (1978). 

The DNA template can be generated by those vectors that are either derived 
5 from bacteriophage M13 vectors (the commercially available M13mpl8 and 

M13mpl9 vectors are suitable), or tiiose vectors that contain a single-stranded phage 
origin of repUcation as described by Viera et al., Meth. Enzvmol.. 153, 3 (1987). 
Thus, the DNA that is to be mutated may be inserted into one of these vectors to 
generate single-stranded template. Production of the single-stranded template is 
10 described in Sections 4.21-4.41 of Sambrook et al., Molecular Cl oning: A 
T^boratorv Manual (Cold Spring Harbor Laboratory Press, N.Y. 1989). 

Alternatively, single-stranded DNA template may be generated by denaturing 
double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence 
1 5 variants, for example), the oligonucleotide is hybridized to the single-stranded 
template under suitable hybridization conditions. A DNA polymerizing enzyme, 
usually the Klenow fragment of DNA polymerase I, is then added to synthesize the 
complementary strand of the template using the oligonucleotide as a primer for 
synthesis. A heteroduplex molecule is thus formed such that one strand of DNA 
20 encodes the mutated form of AAV, and the other sfrand (the original template) 

encodes the native, unaltered sequence of AAV. This heteroduplex molecule is then 
transformed into a suitable host cell, usually a prokaryote such as E. coli JMIOI. 
After tiie cells are grown, they are plated onto agarose plates and screened using the 
oUgonucleotide primer radiolabeled with 32-phosphate to identify the bacterial 
25 colonies that contain the mutated DNA. The mutated region is then removed and 
placed in an appropriate vector, generally an expression vector of the type typically 
employed for transformation of an appropriate host. 

The method described immediately above maybe modified such that a 
homoduplex molecule is created whereui botii strands of the plasmid contain the 
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mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is annealed to the single-straaded template as described above. A 
mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with a 
modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained from the 
Amersham Corporation). This mixture is added to the template-oUgonucleotide 
complex. Upon addition of DNA polymerase to this mixture, a strand of DNA 
identical to the template except for the mutated bases is generated. In addition, this 
new strand of DNA wiU contain dCTP-(aS) instead of dCTP, which serves to 
protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked with 
an appropriate restriction enzyme, the template strand can be digested with ExoIII 
nuclease or another appropriate nuclease past the region that contains the site(s) to 
be mutagenized. The reaction is then stopped to leave a molecule that is only 
partially single-stranded. A complete double-stranded DNA homoduplex is then 
formed using DNA polymerase in tiie presence of all four deoxyribonucleotide 
triphosphates, ATP, and DNA ligase. This homoduplex molecule can then be 
transformed into a suitable host cell such as E. coli JMIOI. 

For example, a preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a DNA segment comprising SEQ ID N0:3, 
SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, a subunit thereof or a variant thereof 
having nucleotide substitutions, or deletions or insertions. 

n. Preparation of Molecules Useful to Practice the Methods of the Invention 

A. Nucleic Acid Molecules 

1. Chimeric Expression Cassettes 

To prepare expression cassettes for transformation herein, the recombinant 
or preselected DNA sequence or segment may be circular or linear, double-stranded 
or single-stranded. Generally, the preselected DNA sequence or segment is in the 
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form of chimeric DNA, such as plasmid DNA, that can also contain coding regions 
flanked by control sequences which promote the expression of the preselected DNA 
present in the resultant cell line. 

As used herein, "chimeric" means that a vector comprises DNA from at least 
5 two different species, or comprises DNA from the same species, which is linked or 
associated in a manner which does not occur in the "native" or wild type of the 
species. 

Aside from the preselected DNA sequences described above, a portion of the 
preselected DNA may serve a regulatory or a structural ftmction. For example, the 
j 1 0 preselected DNA may itself comprise a promoter that is active in mammaUan cells, 

or may utilize a promoter already present in the genome that is the transformation 
target. Such promoters include the CMV promoter, as well as the S V40 late 
promoter and retroviral LTRs (long terminal repeat elements), although many other 
promoter elements well known to the art may be employed in the practice of the 
15 invention. 

Other elements ftmctional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected DNA. 
5r{ Such elements may or may not be necessary for the ftinction of the DNA, but may 

provide improved expression of the DNA by affecting transcription, stability of the 
20 mRNA, or the like. Such elements may be included in the DNA as desired to obtain 
the optimal performance of the transforming DNA in the cell. 

"Control sequences" is defmed to mean DNA sequences necessary for the 
expression of an operably Unked coding sequence in a particular host organism. The 
control sequences that are suitable for prokaryotic cells, for example, include a 
25 promoter, and optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utiHze promoters, polyadenylation signals, and 
enhancers. 

"Operably linked" is defmed to mean that the nucleic acids are placed in a 
functional relationship with another nucleic acid sequence. For example, DNA for a 
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presequence or secretory leader is operably linked to DNA for a peptide or 
polypeptide if it is expressed as a preprotein that participates in the secretion of the 
peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site is 
5 operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are 
contiguous and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accompUshed by 
ligation at convenient restriction sites. If such sites do not exist, the synthetic 
N 1 0 oligonucleotide adaptors or linkers are used in accord with conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells sought 
to be transformed. Alternatively, the selectable marker may be carried on a separate 
1 5 piece of DNA and used in a co-transformation procedure. Both selectable markers 
and reporter genes may be flanked with appropriate regulatory sequences to enable 
expression in the host cells. Useftil selectable markers are well known in the art and 
include, for example, antibiotic and herbicide-resistance genes, such as neo, hpt, 
dhfr, bar, aroA, dapA and the Uke. See also, the genes listed on Table 1 of 
20 Lundquist et al. (U.S. Patent No. 5,848,956). 

Reporter genes are used for identifying potentially transformed cells and for 
evaluating the functionality of regulatory sequences. Reporter genes which encode 
for easily assayable proteins are well known in the art. hi general, a reporter gene is 
a gene which is not present in or expressed by the recipient organism or tissue and 
25 which encodes a protein whose expression is manifested by some easily detectable 
property, e.g., enzymatic activity. Preferred genes uiclude the chloramphenicol 
acetyl transferase gene (cat) from Tn9 oiE. coli, the beta-glucuronidase gene (gus) 
of the uidA locus ofE. coli, and the luciferase gene from QxeOy Photiniis pyralis. 
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Expression of the reporter gene is assayed at a suitable time after the DNA has been 
introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 
compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular C loning: A Laboratory 
Manual Cold Spring Harbor Laboratory Press (2d ed., 1989), provides suitable 
methods of construction. 

2. Transformation into Host Cells 

The recombinant DNA can be readily introduced into the host cells, e.g., 
mammalian, bacterial, yeast or insect cells by transfection with an expression vector 
of the invention, by any procedure useful for the introduction into a particular cell, 
e.g., physical or biological methods, to yield a transformed cell having the 
recombinant DNA stably integrated into its genome or present as an episome which 
can persist in the transformed cells, so that the DNA molecules, sequences, or 
segments, of the present invention are maintained and/or expressed by the host cell. 

Physical methods to introduce a preselected DNA into a host cell include 
calcium phosphate precipitation, lipofection, particle bombardment, microinjection, 
electroporation, and the like. Biological methods to introduce the DNA of interest 
into a host cell include the use of DNA and RNA viral vectors. The main advantage 
of physical methods is that they are not associated with pathological or oncogenic 
processes of viruses. However, they are less precise, often resulting in multiple 
copy insertions, random integration, disruption of foreign and endogenous gene 
sequences, and unpredictable expression. 

As used herein, the term "cell line" or "host cell" is intended to refer to well- 
characterized homogenous, biologically pure populations of cells. These cells may 
be eukaryotic cells that are neoplastic or which have been "immortalized" in vitro by 
methods known in the art, as well as primary cells, or prokaryotic cells. The cell line 
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or host cell is preferably of mammalian origin, but cell lines or host cells of non- 
mammalian origin may be employed, including plant, insect, yeast, fUngal or 
bacterial sources. Generally, the preselected DNA sequence is related to a DNA 
sequence which is resident in the genome of the host cell but is not expressed, or not 
highly expressed, or, alternatively, overexpressed. 

"Transfected" or "transformed" is used herein to include any host cell or cell 
Une, the genome of which has been altered or augmented by the presence of at least 
one preselected DNA sequence, which DNA is also referred to in the art of genetic 
engineering as "heterologous DNA," "recombmant DNA," "exogenous DNA," 
"genetically engineered," "non-native," or "foreign DNA," wherein said DNA was 
isolated and introduced into the genome of the host cell or cell hne by the process of 
genetic engineering. The host cells of the present invention are typically produced 
by transfection with a DNA sequence in a plasmid expression vector, a viral 
expression vector, or as an isolated linear DNA sequence. 

To confirm the presence of the preselected DNA sequence in the host cell, a 
variety of assays may be performed. Such assays include, for example, "molecular 
biological" assays well known to those of skill in the art, such as Southern and 
Northern blotting, RT-PCR and PGR; "biochemical" assays, such as detecting the 
presence of a polypeptide expressed from a gene present in the vector, e.g., by 
immunological means (immunoprecipitations, immunoaffinity columns, ELISAs 
and Western blots) or by any other assay useful to identify molecules falling within 
the scope of the invention. 

To detect and quantitate RNA produced from introduced DNA segments, 
RT-PCR may be employed. In this appUcation of PGR, it is first necessary to 
reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, and 
then through the use of conventional PGR techniques amplify the DNA. In most 
mstances PGR techniques, while usefiil, wiU not demonstrate integrity of the RNA 
product. Further information about the nature of the RNA product may be obtained 
by Northern blotting. This technique demonstrates the presence of an RNA species 



29 



and gives information about the integrity of that RNA. The presence or absence of 
an RNA species can also be determined using dot or slot blot Northern 
hybridizations. These techniques are modifications of Northern blotting and only 
demonstrate the presence or absence of an RNA species. 

While Southern blotting and PGR may be used to detect the DNA segment in 
question, they do not provide information as to whether the DNA segment is being 
expressed. Expression may be evaluated by specifically identifying the polypeptide 
products of the introduced DNA sequences or evaluating the phenotypic changes 
brought about by the expression of the introduced DNA segment in the host cell. 

m. Dosages. Formulations and Routes of Adm inistration 

Administration of a nucleic acid molecule may be accomplished through the 
introduction of cells transformed with the nucleic acid molecule (see, for example, 
WO 93/02556), the administration of the nucleic acid molecule itself (see, for 
example, Feigner et al., U.S. Patent No. 5,580,859, PardoU et al.. Immunity, 3, 165 
(1995); Stevenson et al., Immunol. Rev.. 145, 211 (1995); MolUng, J. Mol. Med., 
75, 242 (1997); Donnelly et al., Ann. N.Y. Acad. Sci. . 772, 40 (1995); Yang et al., 
Mol. Med. Todav. 2, 476 (1996); Abdallah et al., Biol. Cell. 85, 1 (1995)), through 
infection with a recombinant virus or via liposomes. Pharmaceutical formxilations, 
dosages and routes of administration for nucleic acids are generally disclosed, for 
example, in Feigner et al., supra. 

Administration of the therapeutic agents in accordance with the present 
invention may be continuous or intermittent, depending, for example, upon the 
recipient's physiological condition, whether the purpose of the administration is 
therapeutic or prophylactic, and other factors known to skilled practitioners. The 
administration of the agents of the invention may be essentially continuous over a 
preselected period of tune or may be in a series of spaced doses. Both local and 
systemic administration is contemplated. When the molecules of the invention are 
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employed for prophylactic purposes, agents of the invention are amenable to chronic 
use, preferably by systemic administration. 

One or more suitable unit dosage forms comprising llie therapeutic agents of 
the invaition, which, as discussed below, may optionally be formulated for 
sustained release, can be administered by a variety of routes including oral, or 
parenteral, including by rectal, transdermal, subcutaneous, intravenous, 
intramuscular, intraperitoneal, intrathoracic, intrapulmonary and intranasal routes. 
The formulations may, where appropriate, be conveniently presented m discrete unit 
dosage forms and may be prepared by any of the methods well known to pharmacy. 
Such methods may include the step of bringing into association the therapeutic agent 
with liquid carriers, soUd matrices, semi-solid carriers, finely divided sohd carriers 
or combinations thereof, and then, if necessary, introducing or shapmg the product 
into the desired delivery system. 

When the ther^eutic agents of the invention are prepared for oral 
administration, they are preferably combined with a pharmaceutically acceptable 
carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
form. The total active ingredients in such formulations comprise from 0.1 to 99.9% 
by weight of the formulation. By "pharmaceutically acceptable" it is meant the 
carrier, diluent, excipient, and/or salt must be compatible with the other ingredients 
of the formulation, and not deleterious to the recipient thereof The active ingredient 
for oral administration may be present as a powder or as granules; as a solution, a 
suspension or an emulsion; or in achievable base such as a synthetic resin for 
ingestion of the active ingredients from a chewing gum. The active ingredient may 
also be presented as a bolus, electuary or paste. 

Pharmaceutical formulations containing the ther^eutic agents of the 
invention can be prepared by procedures known in the art using well known and 
readily available ingrediaits. For example, tiie agent can be formulated with 
common excipients, diluents, or carriers, and formed into tablets, capsules, 
suspensions, powders, and the like. Examples of excipients, diluents, and carriers 
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that are suitable for such formulations include the following fillers and extenders 
such as starch, sugars, mannitoi, and silicic derivatives; binding agents such as 
carboxymethyl cellulose, HPMC and other cellulose derivatives, alginates, gelatin, 
and polyvinyl-pyrrohdone; moisturizing agents such as glycerol; disintegrating 
agents such as calcium carbonate and sodium bicarbonate; agents for retarding 
dissolution such as paraffin; resorption accelerators such as quaternary ammonium 
compounds; surface active agents such as cetyl alcohol, glycerol monostearate; 
adsoiptive carriers such as kaolin and bentonite; and lubricants such as talc, calcium 
and magnesium stearate, and soUd polyethyl glycols. 

For example, tablets or caplets contaming the agents of the invention can 
include buffering agents such as calcium carbonate, magnesium oxide and 
magnesium carbonate, Caplets and tablets can also include inactive ingredients such 
as cellulose, pregelatinized starch, silicon dioxide, hydroxy propyl methyl cellulose, 
magnesium stearate, microcrystalhne cellulose, starch, talc, titanium dioxide, 
benzoic acid, citric acid, com starch, mineral oil, polypropylene glycol, sodium 
phosphate, and zinc stearate, and the like. Hard or soft gelatin capsules containing 
an agent of the invention can contain inactive ingredients such as gelatin, 
microcrystalline cellulose, sodium laxxryl sulfate, starch, talc, and titanium dioxide, 
and the like, as well as liquid vehicles such as polyethylene glycols (PEGs) and 
vegetable oil. Moreover, enteric coated caplets or tablets of an agent of the invention 
are designed to resist disintegration in the stomach and dissolve in the more neutral 
to alkaline environment of the duodenum. 

The therapeutic agents of the invention can also be formulated as elixirs or 
solutions for convenient oral administration or as solutions appropriate for 
parenteral administration, for instance by intramuscular, subcutaneous or 
intravenous routes. 

The pharmaceutical formulations of the therapeutic agents of the invention 
can also take the form of an aqueous or anhydrous solution or dispersion, or 
altematively the form of an emulsion or suspension. 
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Thus, the therapeutic agent maybe formulated for parenteral administration 
(e.g., by injection, for example, bolus injection or continuous infusion) and may be 
presented in unit dose form in ampules, pre-fiUed syringes, small volume infusion 
containers or in multi-dose containers with an added preservative. The active 
ingredients may take such forms as suspensions, solutions, or emulsions in oily or 
aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. Altematively, the active ingredients may be in 
powder form, obtained by aseptic isolation of sterile solid or by lyophilization from 
solution, for constitution with a suitable vehicle, e.g., sterile, pyrogen-free water, 
before use. 

These formulations can contain pharmaceutically acceptable vehicles and 
adjuvants which are well known in the prior art. It is possible, for example, to 
prepare solutions using one or more organic solvent(s) that is/are acceptable from 
the physiological standpoint, chosen, in addition to water, from solvents such as 
acetone, ethanol, isopropyl alcohol, glycol ethers such as the products sold under the 
name "Dowanol", polyglycols and polyethylene glycols, C1-C4 alkyl esters of short- 
chain acids, preferably ethyl or isopropyl lactate, fatty acid triglycerides such as the 
products marketed under the name "Miglyol", isopropyl myristate, animal, mineral 
and vegetable oils and polysiloxanes. 

The compositions according to the invention can also contain thickening 
agents such as cellulose and/or cellulose derivatives. They can also contain gums 
such as xanthan, guar or carbo gum or gum arable, or altematively polyethylene 
glycols, bentones and montmorillonites, and the like. 

It is possible to add, if necessary, an adjuvant chosen from antioxidants, 
surfactants, other preservatives, film-forming, keratolytic or comedolytic agents, 
perfumes and colorings. Also, other active ingredients may be added, whether for 
the conditions described or some other condition. 

For example, among antioxidants, t-butylhydroquinone, butylated 
hydroxyanisole, butylated hydroxytoluene and a-tocopherol and its derivatives may 
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be mentioned. The galenical forms chiefly conditioned for topical appUcation take 
the form of creams, milks, gels, dispersion or microemulsions, lotions thickened to a 
greater or lesser extent, impregnated pads, ointments or sticks, or alternatively the 
form of aerosol formulations in spray or foam form or alternatively in the form of a 
cake of soap. 

Additionally, the agents are well sidted to formulation as sustained release 
dosage forms and the like. The formulations can be so constituted that they release 
the active ingredient only or preferably in a particular part of the intestinal or 
respiratory tract, possibly over a period of time. The coatings, envelopes, and 
protective matrices may be made, for example, from polymeric substances, such as 
polylactide-glycolates, hposomes, microemulsions, microparticles, nanoparticles, or 
waxes. These coatings, envelopes, and protective matrices are useful to coat 
indwelUng devices, e.g., stents, catheters, peritoneal dialysis tubing, and the like. 

The therapeutic agents of the invention can be dehvered via patches for 
h-ansdermal administration. See U.S. Patent No. 5,560,922 for examples of patches 
suitable for transdermal delivery of a therapeutic agent. Patches for transdermal 
delivery can comprise a backing layer and a polymer matrix which has dispersed or 
dissolved therein a therapeutic agent, along with one or more skin permeation 
enhancers. The backing layer can be made of any suitable material which is 
impermeable to the therapeutic agent. The backing layer serves as a protective 
cover for the matrix layer and provides also a support function. The backing can be 
formed so that it is essentially the same size layer as the polymer matrix or it can be 
of larger dimension so that it can extend beyond the side of the polymer matrix or 
overlay the side or sides of the polymer matrix and then can extend outwardly in a 
manner that the surface of the extension of the backing layer can be the base for an 
adhesive means. Alternatively, the polymer matrix can contain, or be formulated of, 
an adhesive polymer, such as polyacrylate or acrylate/vinyl acetate copolymer. For 
long-term applications it might be desirable to use microporous and/or breathable 
backing laminates, so hydration or maceration of the skin can be minimized. 
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Examples of materials suitable for making the backing layer are films of high 
and low density polyethylene, polypropylene, polyurethane, polyvinylchloride, poly- 
esters such as poly(ethylene phthalate), metal foils, metal foil laminates of such 
suitable polymer fihns, and the hke. Preferably, the materials used for the backing 
layer are laminates of such polymer fihns with a metal foil such as aluminum foil. 
In such laminates, a polymer fihn of the laminate will usually be in contact with the 

adhesive polymer matrix. 

The backing layer can be any appropriate thickness which will provide the 
desired protective aud support functions, A suitable thickness will be firom about 10 
to about 200 microns. 

Generally, those polymers used to form the biologically acceptable adhesive 
polymer layer are those capable of forming shaped bodies, thin walls or coatings 
through which therapeutic agents can pass at a controlled rate. Suitable polymers 
are biologically and pharmaceutically compatible, nonallergenic and insoluble in 
and compatible with body fluids or tissues with which the device is contacted. The 
use of soluble polymers is to be avoided since dissolution or erosion of the matrix by 
skin moisture would affect the release rate of the therapeutic agents as well as the 
capability of the dosage unit to remain in place for convenience of removal. 

Exemplary materials for fabricating the adhesive polymer layer include 
polyethylene, polypropylene, polyurethane, ethylene/propylene copolymers, 
ethylene/ethylacrylate copolymers, ethylene/vinyl acetate copolymers, siUcone 
elastomers, especially the medical-grade polydimethylsiloxanes, neoprene rubber, 
polyisobutylene, polyacrylates, chlorinated polyethylene, polyvinyl chloride, vinyl 
chloride-vinyl acetate copolymer, crosslinked polymethacrylate polymers (hydro- 
gel), polyvinylidene chloride, poly(ethylene terephthalate), butyl rubber, 
epichlorohydrin rubbers, ethylenvinyl alcohol copolymers, ethylene-vinyloxyethanol 
copolymers; silicone copolymers, for example, polysiloxane-polycarbonate 
copolymers, polysiloxanepolyethylene oxide copolymers, polysiloxane- 
polymethacrylate copolymers, polysiloxane-alkylene copolymers (e.g., polysiloxane- 
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ethylene copolymers), polysiloxane-alkylenesilane copolymers (e.g., polysiloxane- 
ethylenesilane copolymers), and the like; cellulose polymers, for example methyl or 
ethyl cellulose, hydroxy propyl methyl cellulose, and cellulose esters; 
polycarbonates; polytetrafluoroethylene; and the Uke. 

Preferably, a biologically acceptable adhesive polymer matrix should be 
selected from polymers with glass transition temperatures below room temperature. 
The polymer may, but need not necessarily, have a degree of crystalhnity at room 
temperature. Cross-linking monomeric units or sites can be incorporated into such 
polymers. For example, cross-linking monomers can be incorporated into 
polyacrylate polymers, which provide sites for cross-linking the matrix after 
dispersing the therapeutic agent into the polymer. Known cross-Unking monomers 
for polyacrylate polymers include polymethacrylic esters of polyols such as butylene 
diacrylate and dimethacrylate, trimethylol propane trimethacrylate and the like. 
Other monomers which provide such sites include allyl acrylate, allyl methacrylate, 
diallyl maleate and the like. 

Preferably, a plasticizer and/or humectant is dispersed within the adhesive 
polymer matrix. Water-soluble polyols are generally suitable for this purpose. 
Incorporation of a humectant in the formulation allows the dosage unit to absorb 
moisture on the surface of skin which in turn helps to reduce skin irritation and to 
prevent the adhesive polymer layer of the delivery system from failing. 

Therapeutic agents released from a transdermal delivery system must be 
capable of penetrating each layer of skin. In order to increase the rate of permeation 
of a therapeutic agent, a transdermal drug delivery system must be able in particular 
to increase the permeabiUty of the outermost layer of skin, the stratum comeum, 
which provides the most resistmce to the penetration of molecules. The fabrication 
of patches for transdermal delivery of therapeutic agents is well known to the art. 

For administration to the upper (nasal) or lower respiratory tract by 
inhalation, the therapeutic agents of the invention are conveniently dehvered from 
an insufflator, nebuUzer or a pressurized pack or other convenient means of 
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delivering an aerosol spray. Pressurized packs may comprise a suitable propellant 
such as dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, 
carbon dioxide or other suitable gas. hi the case of a pressurized aerosol, the dosage 
unit may be determined by providing a valve to dehver a metered amount. 

5 Alternatively, for administration by inhalation or insufflation, the 

composition may take the form of a dry powder, for example, a powder mix of the 
therapeutic agent and a suitable powder base such as lactose or starch. The powder 
composition may be presented in unit dosage form in, for example, capsules or 
cartridges, or, e.g., gelatme or bUster packs from which the powder may be 

1 0 administered with the aid of an inhalator, msufflator or a metered-dose inhaler. 

For intra-nasal administration, the therapeutic agent maybe administered via 
nose drops, a liquid spray, such as via a plastic bottle atomizer or metered-dose 
inhaler. Typical of atomizers are the Mistometer (Wintrop) and tiie Medihaler 
(Riker). 

1 5 The local delivery of the therapeutic agents of the invention can also be by a 

variety of techniques which administer the agent at or near the site of disease. 
Examples of site-specific or targeted local dehvery techniques are not intended to be 
limiting but to be illustrative of the techniques available. Examples mclude local 
delivery catheters, such as an infusion or indwelling catheter, e.g., a needle infusion 

20 catheter, shunts and stents or other implantable devices, site specific carriers, direct 
injection, or direct applications. 

For topical administration, the therapeutic agents maybe formulated as is 
known in the art for direct application to a target area. Conventional forms for this 
purpose include wound dressmgs, coated bandages or other polymer coverings, 

25 ointments, creams, lotions, pastes, jeUies, sprays, and aerosols. Ointinents and 
creams may, for example, be formulated with an aqueous or oily base with the 
addition of suitable thickening and/or gelling agents. Lotions may be formulated 
with an aqueous or oily base and will in general also contain one or more 
emulsifying agents, stabilizing agents, dispersing agents, suspendmg agents. 
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thickening agents, or coloring agents. The active ingredients can also be delivered 
via iontophoresis, e.g., as disclosed in U.S. Patent Nos, 4,140,122; 4,383,529; or 
4,05 1,842. The percent by weight of a therapeutic agent of the invention present in 
a topical formulation will depend on various factors, but generally will be from 
0.01% to 95% of the total weight of the formulation, and typically 0.1-25% by 
weight. 

Drops, such as eye drops or nose drops, may be formulated with an aqueous 
or non-aqueous base also comprising one or more dispersing agents, solubilizing 
agents or suspending agents. Liquid sprays are conveniently deUvered from 
pressurized packs. Drops can be delivered via a simple eye dropper-capped bottle, 
or via a plastic bottle adapted to deliver liquid contents dropwise, via a specially 
shaped closure. 

The therapeutic agent may ftirther be formulated for topical administration in 
the mouth or throat. For example, the active ingredients may be formulated as a 
lozenge further comprising a flavored base, usually sucrose and acacia or tragacanth; 
pastilles comprising the composition in an inert base such as gelatin and glycerin or 
sucrose and acacia; and mouthwashes comprising the composition of the present 
invention in a suitable liquid carrier. 

The formulations and compositions described herein may also contain other 
ingredients such as antimicrobial agents, or preservatives. Furthermore, the active 
ingredients may also be used in combination with other therapeutic agents, for 
example, bronchodilators. 

In particular, for delivery of a vector of the invention to a tissue such as 
muscle, any physical or biological method that will introduce the vector into the 
muscle tissue of a host animal can be employed. Vector means both a bare 
recombinant vector and vector DNA packaged into viral coat proteins, as is well 
known for AAV administration. Simply dissolving an AAV vector in phosphate 
buffered saline has been demonstrated to be sufficient to provide a vehicle useful for 
muscle tissue expression, and there are no known restrictions on the carriers or other 
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components that can be coadministered with the vector (although compositions that 
degrade DNA should be avoided in the normal manner with vectors). 
Pharmaceutical compositions can be prepared as injectable formulations or as 
topical formulations to be delivered to the muscles by transdermal transport. 
5 Numerous formulations for both intramuscular injection and transdermal transport 
have been previously developed and can be used in the practice of the invention. 
The vectors can be used with any pharmaceutically acceptable carrier for ease of 
administration and handling. 

For purposes of intramuscular injection, solutions in an adjuvant such as 
10 sesame or peanut oil or in aqueous propylene glycol can be employed, as well as 
sterile aqueous solutions. Such aqueous solutions can be buffered, if desired, and 
^ the Uquid diluent first rendered isotonic with saline or glucose. Solutions of the 

AAV vector as a firee acid (DNA contains acidic phosphate groups) or a 
pharmacologically acceptable salt can be prepared in water suitably mixed with a 
15 surfactant such as hydroxypropylcellulose. A dispersion of AAV viral particles can 
also be prepared in glycerol, liquid polyethylene glycols and mixtures thereof and in 
oils. Under ordinary conditions of storage and use, these preparations contain a 
ly preservative to prevent the growth of microorganisms. In this connection, the sterile 

aqueous media employed are all readily obtainable by standard techniques well- 
20 known to those skilled in the art. 

The pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersions. In all cases the form must be sterile and 
must be fluid to the extent that easy syringability exists. It must be stable under the 
25 conditions of manufacture and storage and must be preserved against the 

contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
(for example, glycerol, propylene glycol, liquid polyethylene glycol and the like), 
suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, 
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for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of a dispersion and by the use of surfactants. The 
prevention of the action of microorganisms can be brought about by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal and the like, hi many cases it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of 
the injectable compositions can be brought about by use of agents delaying 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the AAV vector in 
the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the sterilized active ingredient into a 
sterile vehicle which contains the basic dispersion medium and the required other 
ingredients firom those enumerated above. Li the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and the freeze drying technique which yield a powder of the active 
ingredient plus any additional desired ingredient from the previously sterile-filtered 
solution thereof 

For purposes of topical administration, dilute sterile, aqueous solutions 
(usually in about 0.1% to 5% concentration), otherwise similar to the above 
parenteral solutions, are prepared in containers suitable for incorporation into a 
transdermal patch, and can include known carriers, such as pharmaceutical grade 
dimethylsulfoxide (DMSO). 

The therapeutic compounds of this invention may be administered to a 
mammal alone or in combination with pharmaceutically acceptable carriers. As 
noted above, the relative proportions of active ingredient and carrier are determined 
by the solubility and chemical nature of the compound, chosen route of 
administration and standard pharmaceutical practice. 
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The dosage of the present therapeutic agents which will be most suitable for 
prophylaxis or treatment will vary with the form of administration, the particular 
compound chosen and the physiological characteristics of the particular patient 
under treatment. Generally, small dosages will be used initially and, if necessary, 
will be increased by small increments until the optimum effect under the 
circumstances is reached. Exemplary dosages are set out in the example below. 

Since AAV has been shown to have a broad host range (for pulmonary 
expression) and persists in muscle, the vectors of the invention maybe employed to 
express a gene in any animal, and particularly in mammals, birds, fish, and reptiles, 
especially domesticated mammals and birds such as cattle, sheep, pigs, horses, dogs, 
cats, chickens, and turkeys. Both human and veterinary uses are particularly 
preferred. 

The gene being expressed can be either a DNA segment encoding a protein, 
with whatever control elements (e.g., promoters, operators) are desired by the user, 
or a non-coding DNA segment, the transcription of which produces all or part of 
some RNA-containing molecule (such as a transcription control element, +RNA, or 
anti-sense molecule). 

Muscle tissue is a very attractive target for in vivo gene deUvery and gene 
therapy, because it is not a vital organ and is very easy to access. If a disease is 
caused by a defective gene product which is required to be produced and/or secreted, 
such as hemophiha, diabetes and Gaucher' s disease, and the like, is muscle is a good 
cmdidate to supply the gene product if the appropriate gene can be effectively 
delivered into the cells. 

Different vectors, such as naked DNA, adenovirus and retrovirus, have been 
utilized to directly deliver various transgenes into muscle tissues. However, neither 
system can offer both high efficiency and long-term expression. For n^ed plasmid 
DNA directly delivered into muscle tissue, the efficiency is not high. There are only 
a few cells near the injection site that can maintain transgene expression. 
Furthermore, the plasmid DNA in the cells remains as non-replicating episomes, i.e., 
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in the unintegrated form. Therefore, it will be eventually lost. For adenovirus 
vector, it can infect the non-dividing cells, and therefore, can be directly delivered 
into the mature tissues such as muscle. However, the transgene delivered by 
adenovirus vectors are not useful to maintain long-term expression for the following 
reasons. First, since adenovirus vectors still retain most of the viral genes, they are 
not very safe. Moreover, the expression of those genes can cause the immune 
system to destroy the cells containing the vectors (see, for example, Yang et al. 
1994, Proc. Nath Acad. Sci. 91:4407-441 1), Second, since adenovirus is not an 
integration virus, its DNA will eventually be diluted or degraded in the cells. Third, 
due to the immune response, adenovirus vector could not be repeatedly delivered. 
In the case of lifetime diseases, this will be a major limitation. For retrovirus 
vectors, although they can achieve stable integration into the host chromosomes, 
their use is very restricted because they can only infect dividing cells while a large 
majority of the muscle cells are non-dividing. 

Adeno-associated virus vectors have certain advantages over the above- 
mentioned vector systems. First, like adenovirus, AAV can efficiently infect non- 
dividing cells. Second, all the AAV viral genes are eliminated in the vector. Since 
the viral-gene-expression-induced immune reaction is no longer a concern, AAV 
vectors are safer than Ad vectors. Thirds, AAV is an integration virus by nature, 
and integration into the host chromosome will stably maintain its transgene in the 
cells. Fourth, AAV is an extremely stable virus, which is resistant to many 
detergents, pH changes and heat (stable at 56'C for more than an hour). It can be 
lyophilized and redissolved without losing its activity. Therefore, it is a very 
promising delivery vehicle for gene therapy. 

The invention will be further described by, but is not limited to, the 
following examples. 
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Example 1 

Materials and Methods 
Construction of r AAV Shuttle Vector. 

A recombinant AAV shuttle vector (AV.GFPSori) which contained a GFP 
transgene cassette, bacterial ampicillin resistance gene, and bacterial origin of 
replication, was generated from a cw-acting plasmid (pCisAV.GFPSori). 
Expression of the GFP gene was directed by the CMV promoter/enhancer and SV40 
poly-adenylation sequences. pCisAV.GFP3ori was constructed with pSub201 
derived ITR elements (Samulski et al., 1987) and the intactness of ITR sequences 
was confirmed by restriction analysis with Smal and PvuII, and by sequencing. 
Recombinant AAV stocks were generated by co-transfection of pCisAV.GFPBori 
and pRep/Cap together with co-infection of recombinant Ad.CMVlacZ in 293 cells 
(Duan et al., 1997). Following transfection of forty 150 mm plates, cells were 
collected at 72 hours by centrifugation and resuspended in 12 ml of buffer (10 mM 
Tris pH 8.0). Virus was released from cells by three cycles of freeze/thawing and 
passaged through a 25 gauge needle six times. Cell lysates were then treated with 
1.3 mg/ml DNase I at 37°C for 30 minutes and 1% deoxycholate (g/ml final) and 
0.05% trypsin (g/ml final) at 37°C for 30 minutes. Samples were then placed on ice 
for 10 minutes and centriftiged to remove large particulate material at 3,000 rpm for 
30 minutes. 

rAAV was purified by isopycnic density gradient centrifugation in CsCl 
(r=1.4) in a SW55 rotor for 72 hours at 35K. Peak fractions of AAV were combined 
and re-purified through two more rounds of CsCl centrifugation, followed by 
heating at 58°C for 60 minutes to inactivate all contaminant helper adenovirus. 
Typically, this preparation gave approximate AAV titers of 10'^ DNA molecules/ml 
and 2.5 x 10^ GFP-expressing vmits/ml. Recombinant viral titers were assessed by 
slot blot and quantified against pCisAV.GFP3ori controls for DNA particles, 
Fimctional transducing units were quantified by GFP transgene expression in 293 
cells. The absence of helper adenovirus was confirmed by histochemical staining of 
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rAAV infected 293 cells for beta-galactosidase, and no recombinant adenovirus was 
found in 10^^ particles of purified rAAV stocks. The absence of significant wtAAV 
contamination was confirmed by immunocytochemical staining of r AAV/Ad 
co-infected 293 cells with anti-Rep antibodies. These studies, which had a 
sensitivity of 1 wtAAV in 10^^ rAAV particles, demonstrated an absence of Rep 
staining as compared to pRep/Cap plasmid transfected controls. 
Isolation and Structural Evaluation of AAV Circular Intermediates From Hela Cells. 

Hela cells were grown in 35 mm dishes in DMEM media supplemented with 
10% fetal calf serum (FCS). Cells were infected in the presence of 2% FCS at 80% 
confluency with recombinant AV.GFP3ori (MOI=1000 particles/cell, 1x10^ total 
particles/plate) and Hirt DNAs isolated as described by Duan et al. (1997) at 6, 12, 
24, 48, and 72 hours post-infection. In experiments analyzing the effects of 
adenovirus, plates were co-infected with Ad.CMVLacZ (MOI=5000 particles/cell) 
in the presence of 2%FCS/DMEM. Zero hour controls were generated by mixing 
10^ particles of AV,GFP3ori with cell lysates prior to Hirt DNA preparation, Hirt 
DNA isolated at each time point was used to transform E, coli SURE cells 
(Stratagene, La JoUa, CA.). Typically, 1/10 of the Hirt DNA preparation was used 
to transform 40 ml of competent bacteria by electroporation. The resultant total 
number of bacterial colonies was quantified for each time point and the structure of 
circular intermediates was evaluated for greater than 20 plasmid clones for each time 
point from two independent experiments. Structural determinations were based on 
restriction enzyme analysis using PstI, SphI, Asel single and double digests together 
with Southern blotting against GFP, stuffer, and ITR probes. 
Evaluation of E2a and GFP gene expression in Hela cells. 

E2a gene expression was evaluated by immunofluorescent staining of Hela 
cells superinfected with El-deleted Ad.CMVlacZ (MOI= 0, 500, 5000 
particles/cell). Briefly, cells were fixed in methanol at -20°C for 10 minutes 
followed by air drying. Cells were then incubated at room temperature with 
hybridoma supematant against Ad5 72kd DBF (Reich et al., 1983), followed by goat 
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anti-mouse-FITC antibody (5 mg/ml) for 30 minutes at room temperature. In 
studies evaluating augmentation of AAV GFP transgene expression by adenovirus, 
Hela cells were harvested at 24 or 72 hours post-infection by trypsinization, 
resuspended in 2%FCS/PBS and evaluated by FACS analyses. Thresholds were set 
using uninfected controls and the percentage and/or the average relative fluorescent 
intensity was determined by sorting greater than 10^ cells per experiment condition. 
Sequence Analvsis of AAV Circular Intermediates. 

Sequence analj^is of the ITR array within circular intermediates was 
performed using primers EL118 (5'-CGGGGGTCGTTGGGCGGTCA-3'; SEQ ID 
N0:1) and EL230 (5'-GGGCGGAGCCTATGGAAAA-3'; SEQ ID N0:2) which 
are nested to 5 ' and 3 ' ITR sequences, respectively. Both circular and linearized 
(with Sm^ which cuts within ITR sequences) plasmids were sequenced. 
Results 

Construction of rAAV Shuttle Vector and Isolation of C ircular Intermediates. 

To circumvent the inability to retrieve pre-integration intermediates or as 
stable episomal forms resistant to nuclease digestion, an alternative strategy was 
developed to "trap" circular intermediates using a recombinant AAV shuttle vector. 
Recombinant AV.GFP3ori virus (Figure IB) was generated from a cis-acting 
plasmid (pCisAV.GFP3ori, Figure 1 A) by co-transfection in 293 cells with 
trans-acting plasmids encoding Rep and Cap viral genes. This viral vector 
(AV.GFP3ori) encoded the green fluorescent protein (GFP) reporter gene, a 
bacterial origin of repUcation (ori), and the bacterial ampicillin-resistance gene. Ori 
and ampicillin-resistance sequences encoded in this virus allow for the rescue of 
circular AAV genomes formed during the transduction process. 

To test this strategy, Hela cells were infected with AV.GFP3ori (MOI=1000 
particles/cell) and the abundance of circular intermediates was evaluated following 
transformation of low molecular weight cellular Hirt DNA into E. coli SURE ceUs. 
The presence of cucular intermediates was inferred by retrievable 
ampicillin-resistant bacterial colonies. Structural features of circular intermediates 
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were determined by restriction enzyme analysis and Southern blotting with various 
regions of the provirus, including GFP, Stuffer, and ITR sequences. 

The predominant circular form isolated after transduction of Hela cells with 
AV.GFPSori consisted of 4.7 kb monomer-sized molecules (Figure IC). SphI 
digestions of these circular intermediates yielded characteristic 300 bp bands which 
hybridized to an ITR probe on Southern blots (Figure 2A). PstI, SphI, Asel single 
and double digests together with Southern blot analysis using GFP, Stuffer (data not 
shown), and ITR (Figure 2A) probes confirmed the structure of the circular 
intermediates as head-to-tail monomer genomes (Figure IC). In particular, PstI 
digests together with ITR Southern blots distinguish these head-to-tail circular 
intermediates jfrom head-to-head or tail-to-tail circular dimers. Similar results 
obtained with studies on AV.GFPSori infected 293 cells and primary fibroblasts 
have confirmed that monomer head-to-tail circular intermediates were also the most 
abundant form in these cell types. 

Because the predicted molecular weight of an intact head-to-tail ITR SphI 
firagment would be approximately 360 bp, an anomalous migration in agarose gels 
might be due to the high secondary structure of inverted repeats within ITRs. To 
this end, the head-to-tail orientation of the ITRs, as predicted by Southem blot 
analysis, was confirmed using several sequencing strategies. First, the SphI ITR 
hybridizing fragment of a circular intermediates was subcloned into a secondary 
plasmid vector and sequenced with primers outside the ITR cloned sequences. 
These findings confirmed the head-to-tail orientation of ITRs. Additionally, 
sequence was obtained directly firom six monomer circular ititermediate clones using 
primers internal to both the 5' and 3' ITRs (Figure 2C). In these studies, circular 
intermediates were digested with Smal and the linear 4.6 kb plasmid was gel 
isolated prior to sequencing. Smal digestion (which relaxed the secondary structure 
of ITRs) was necessary to obtain sequence information within the ITRs. The 
sequencing results presented in Figure 2C confirmed the orientation of head-to-tail 
ITR arrays in these intermediates. 
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Interestingly, sequencing also revealed several consistent base pair (bp) 
changes in four ofthe six clones analyzed (Figure 2C). These four clones (p79, 
p81, p87, and p88) had consistent two bp changes within the D-sequence [G->A 
(122bp) and A->G (125bp)], which always occurred together with the bp alterations 
in the p5 promoter [A->G (1 14bp) and A->C (1 15bp)]. No other consistent bp 
changes were noted except for two clones (p79 and p88) which demonstrated 
mutations just outside the 3'ITR D-sequence [T->G (381bp) and T->C (383bp)]. 

Although head-to-tail circular intermediates were the most abimdant forms 
present in Hirt DNA from rAAV infected Hela cells, several less frequent structures 
were also detected. These included monomer circularized AAV genomes with one 
(pi 90) and three ITRs (p345) arranged in a head-to-tail fashion as well as several 
clones with an unknown structure lacking complete ITRs (p340) (Figure 2A). Such 
diversity within the ITR array may represent homologous recombination in vivo or 
in bacteria during ampUfication. However, previous studies demonstrating similar 
variations in ITR sequences of head-to-tail integrated genomes, suggest that such 
changes in the length of the ITR array may occur in vivo (Duan et al., 1997) 
Additionally, less frequent head-to-tail ckcularized multimer forms were predicted 
based on the variation in migration pattems of uncut plasmids which gave identical 
restriction pattems. Results shown in Figure 2B confirmed the existence of 
monomer and dimer head-to-tail circular intermediates using partial digestion with 
an enzyme which cuts once in the AAV genome (Asel). Cumulative analysis of 
greater than 200 independently isolated circular intermediates from Hela cells 
demonstrated that head-to-tail circular AAV genomes occurred in greatest 
abundance as monomers (92%) and less frequently as multimers of greater than one 
genome (8%). 

To establish that head-to-tail circular intermediates were formed in vivo and 
not by non-specific bacterial recombination of hnear AAV genomes present in the 
Hirt DNA, a set of reconstitution experiments was performed by which the same 
number of rAAV particles used for infection experiments were spiked into Hela cell 
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lysates prior to Hirt preparations. In these studies, background bacterial 
amplification of Hirt DNA spiked with linear rAAV genomes was negligible 
(Figure 3D) and of the few isolated colonies obtained from these controls, none had 
a predicted head-to-tail structure as assessed by Southern blot restriction enzyme 
analysis (Figure 3E). Additionally, reconstitution experiments transforming 
bacterial with linearized dsDNA AAV genomes did not give rise to significant 
levels of replication competent plasmids or the characteristic head-to-tail structure 
associated with AAV circular intermediates. These findings confirm that circular 
intermediates do not likely arise fi:om non-specific recombination or ligation events 
with either ssDNA or dsDNA linear AAV genomes in bacteria. Additional control 
experiments, demonstrating the lack of stuffer hybridizing sequences in AAV 
circular intermediates by Southern blotting, also confirm that these structures do not 
arise fi-om contamination of viral stocks with pCisAV.GFPSori plasmid. 
The formation of head-to-tail circular AAV intermediates is augmented by 
superinfection with El -deleted adenovirus. 

Many aspects of the wtAAV growth cycle are affected by helper adenovirus, 
including AAV DNA replication, transcription, splicing, translation, and virion 
assembly. Such studies have provided concrete evidence that a subset of Ad early 
gene products provide helper functions for the wtAAV lytic cycle, including: El a, 
Elb, E2a, E4 0RF6 and VAl RNA (Muzyczka, 1992). In this regard, one of the 
most critical factors which is required for AAV repUcation is the 34 kD E4 protein 
(0RF6). Recent observations on the helper fimction of Ad in rAAV transduction 
have also demonstrated that Ad E4 0RF6 is essential for the augmentation of rAAV 
transgene expression seen with adenovirus co-infection (Ferrari et al., 1996; Fisher 
et al., 1996). According to these reports, the rate-limiting step enhanced by these 
adenoviral proteins is the conversion of single stranded AAV genomes to double 
stranded forms. 

Studies evaluating the kinetics of rAAV circular intermediate formation 
demonstrated a time-dependent increase in abundance which peaked at 24 hours 
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post-infection in Hela cells and coincided with the onset of GFP transgene 
expression (Figure 3). To better understand the cellular mechanisms associated with 
AAV circular intermediate formation, the effects of adenoviral co-infection on this 
process were evaluated. The extent of transgene expression and circular 
intermediate formation in AV.GFPSori infected Hela cells with or without 
co-infection with El -deleted recombinant adenovirus was compared. 

Although El -deleted adenoviruses are severely handicapped in their ability 
to synthesize viral gene products, at high MOIs of >5000 significant E2a protein 
expression was noted (Figure 3 A). As an indicator of transgene expression, the 
abundance and average relative intensity of GFP positive cells was determined 
against mock infected controls by fluorescent microscopy (Figure 3B) and FACS 
analysis (Figure 3C) at 72 hours post-infection. In accord with previous reports 
demonstrating augmentation in rAAV transgene expression by adenovirus (Ferrari et 
al., 1996; Fisher et al., 1996), the extent of GFP transgene expression was 
dramatically increased at doses of adenovirus which led to viral gene expression 
(MOI>5000; Figures 3A-C). Additionally, persistence of rAAV transgene 
expression was also augmented by co-infection with El -deleted adenovirus, as 
determined by GFP-expressing colony formation following serial passages 
(Figure 3C). 

If circular intermediates represent a molecular form of rAAV important for 
efficient and/or persistent transgene expression, augmentation of rAAV transgene 
expression by adenovirus might also modulate circular intermediate formation. In 
these studies, the abundance and time course of AAV circular intermediate 
formation was evaluated following superinfection with Ad.CMVLacZ. Results from 
these experiments are shown in Figure 3D, which represents the total number of 
bacterial colonies (per 35 mm plate) obtained following transformation of E. coli 
with Hirt DNA isolated from Hela cells infected with AV.GFPSori (1000 DNA 
particles/cell) with or without co-infection with Ad.CMVlacZ (5,000 particles/cell). 
An MOI of 5000 Ad particles/cell was chosen for these experiments since this level 
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of adenovirus led to minimal cytopathic effect (CPE) with high levels of E2a 
expression. 

These studies demonstrated a nearly 2-fold augmentation by Ad.CMVLacZ 
in the total abundance of AAV rescued plasmid intermediates in E. coli (Figure 3D). 
Southern blot restriction enzyme analysis demonstrated that the predominant forms 
in both the presence and absence of adenovirus were head-to-tail monomer curcular 
intermediates containing the diagnostic 300 bp ITR fragment following SphI 
digestion (Figure BE). Additionally, results demonstrated that adenovirus 
co-infection led to an earlier time of onset and increased stability of AAV 
head-to-tail monomer circular intermediates (Figures 3E and F). For example, at 
6 hours post-infection, head-to-tail circular intermediates were only present in Hela 
cells co-infected with adenovirus. Furthermore, a decline in the percentage of 
head-to-tail circular intermediate clones was seen at 48-72 hours post-AAV 
infection in the absence of adenovirus, hi contrast, this decline was significantly 
blunted by the presence of helper adenovirus (Figure 3F). Based on these findings, 
it was concluded that certain adenoviral proteins produced by superinfection with 
El-deleted adenovirus were capable of modulating circular intermediates formation 
and stability during rAAV transduction. 
Discussion 

In the present study, it was shown that circularization of Hnear AAV 
genomes occurs during rAAV transduction. Circularization appears to 
predominately occur as head-to-tail monomer genomes. However, the existence of 
less abundant circular multimer forms suggests that recombinational events 
subsequent to the initial infection may drive concatamerization of circular genomes. 
The diversity in the length of ITR arrays found within circular intermediates (i.e., 
1-3 ITRs) also supports the notion that these forms may be highly recombinagenic. 
Of mechanistic interest in the formation of circular intermediates is the uniformity 
of mutations observed in the D-sequences and nearby p5 promoter region and the 
confinement of these mutations to the 5'-ITRs. Although the etiology of these base 
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pair changes is unknown, their uniformity suggests that they may have a direct role 
in the formation of circular intermediates and in increased stability. Recent 
findings, which suggest that an endogenous host single strand D-sequence binding 
protein is important in rAAV transduction, lend support to the potential involvement 
of this sequence in circular intermediate formation (Wang et al., 1997; Qing et al., 
1998). Furthermore, it remains to be determined whether the in vivo formation of 
AAV circular intermediates occurs through the circularization of single or double 
stranded AAV genomes. 

By analogy, retroviral transduction intermediates have striking similarities to 
the current findings with AAV. Three DNA forms have been isolated following 
retroviral infection, including linear DNA with long terminal repeats (LTRs) at both 
ends, circular DNA with one LTR, and circular DNA with multiple LTRs 
(Panganiban, 1985). Although it is disputed which of these forms are the direct 
precursor to integration, the existence of circular retroviral genomes which also 
have similar repeat regions at the ends of their genomes suggests the potential for 
common mechanisms with the formation of AAV circular intermediates. These 
AAV circular intermediates could act as integration precursors and/or stable 
episomal genomes. 

The head-to-tail ITR structures found in AAV circular intermediates are 
most characteristic of latent integrated AAV genomes. In contrast, lytic phases of 
AAV growth are typically associated with head-to-head and tail-to-tail repUcation 
form genomes. Hence, it is likely that circular intermediates represent a latent aspect 
of the AAV Hfe cycle. The finding that co-infection with adenovirus leads to 
increased abundance and stability of AAV circular intermediates suggests a novel 
link between adenoviral helper functions and latent infection of AAV. 

Aspects of inverted head-to-tail ITRs, which include palindromic hairpins 
similar in structure to "HoUiday-like" junctions, might impart recombinagenic 
activity which aids in viral integration. Such HolUday junctions have been shown to 
play critical roles in directing homologous recombination in bacteria through the 
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processing of recombination intermediates by RuvABC proteins (West, 1997; Lee et 
al., 1998). Interestingly, a mammalian endonuclease, analogous to bacterial RuvC 
resolvase, has also been isolated from cell lines (Hyde et al., 1994). Despite the 
theoretical considerations which might suggest that circular AAV genomes have 
characteristics of preintegration intermediates, a study with recombinant retrovirus 
has demonstrated that pahndromic LTR-LTR jimctions of MMLV are not efficient 
substrates for proviral integration (Lobel et al, 1989). Nonetheless, circular AAV 
genomes have been previously proposed as integration intermediates based on 
proviral structure (Linden et al., 1996). 

Example 2 

Methods 

Production of rAAV Shuttle Vector. 

The cz5-acting plasmid (pCisAV.GFPSori) used for rAAV production was 
generated by subcloning the Bspl201/Not I fragment (743 bp) of the GFP transgene 
from pEGFP-1 (Clontech) between the CMV enhancer/promoter and SV40polyAby 
blunt-end ligation. A 2.5 kb cassette containing beta-lactamase and bacterial 
replication origin from pUC19 was blunt Ugated down-stream of GFP reporter 
cassette. The ITR elements were derived from pSub201 .2 The entire plasmid 
contains a 4.7 kb AAV component flanked by a 2 kb stuffer sequence. The integrity 
of ITR sequences was confirmed by restriction analysis with Smal and PvuII, and by 
direct sequencing using a modified di-deoxy procedure which allowed for complete 
sequence through both 5 ' and 3 ' ITRs. Recombinant AAV stocks were generated by 
co-transfection of pCisAV.GFP3ori and pRep/Cap together with co-infection of 
recombinant Ad.CMVlacZ in 293 cells. The rAV.GFP3ori virus was subsequently 
purified through 3 roimds of CsCl banding as described in Duan et al, 1997. The 
typical yields from this viral preparation were 1012 DNA molecules/ml. 

DNA titers were determined by viral DNA slot blot hybridization against 
GFP *^^P-labeled probe with copy number plasmid standards. The absence of helper 
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adenoviras was confirmed by histochemical staining of rAAV infected 293 cells for 
beta-galactosidase, and no recombinant adenovirus was found in 10^^ particles of 
purified rAAV stocks. The absence of significant wtAAV contamination was 
confirmed by immunocytochemicai staining of rAAV/Ad co-infected 293 cells with 
5 anti-Rep antibodies. Transfection with pRep/Cap was used to confirm the 

specificity of immunocytochemicai staining. No immunoreactive Rep staining was 
observed in 293 cells infected with 10^^ rAAV particles. 
Isolation of AAV Circular Intermediates From Muscle. 

The tibiaUs anterior muscle of 4-5 week old C57BL/6 mice were infected 
10 with AV.GFP3ori (3 X 10^^ particles) in Hepes buffered saline (30 ^1). GFP 
|3 expression was analyzed by direct immunofluorescence of fireshly excised tissues 

and/or in formalin-fixed cryopreserved tissue sections in four independently injected 
muscles harvested at 0, 5, 10, 16, 22 and 80 days post-infection. Tissue sections 
yn were counter-stained with propidium iodide to identify nuclear DNA. Hirt DNA 

m 1 5 (Hirt, 1 967) (20 ml per muscle sample) was isolated fi'om at least three independent 

'WIS? 

muscle specimen for each time point and used to transform E, coli SURE cells using 
3 ml of Hirt with 40 ml of electrocompetent bacterial (approximately 1x10^ cfii/ug 
DNA, Strategene Inc.). The resultant total number of bacterial colonies was 
quantified for each time point and the abundance of head-to-tail circular 
20 intermediates was evaluated for each time point (> 20 bacterial clones analyzed) by 
PstI, Asel, SphI, and Pstl/Asel digestion, and confirmed by Southern blot analysis 
using ITR, GFP and stuffer probes. The head-to-tail configuration in typical clones 
were also confirmed by dideoxy sequencing using primers ELI 1 8 
(5'-CGGGGGTCGTTGGGCGGTCA-3'; SEQ ID N0:1) and EL230 
25 (5'-GGGCGGAGCCTATGGAAAA-3'; SEQ ID N0:2) which are nested to 5' and 
3' ITR sequences, respectively. Zero hour controls were generated by mixing 3 x 
10^^ particles of AV.GFP3ori with control uninfected muscle lysates prior to Hirt 
DNA preparation. As described in Table 1, a number of additional controls for were 
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performed to rule out non-specific recombination of linear AAV genomes in 
bacteria as a source for isolated circular intermediates. 



Table 1. Control Experiments for Rescue of Circular Intermediates in Bacteria 



Type of Input 
DNA 


Source of DNA 


Number of 
Molecules 


Number of 
Amp Resistant 
Bacterial 

Colonies 


Presence of 
Head-to-Tail 

Circular 
Intermediates® 


Purified rAVV 


Hirt from Infected 
Muscle (22 day) 


3 X 10^^ 


approximately 
5x10^ 


Yes 


Purified rAAV 


Virus 

rCuUilollLULCU. llllU 

Uninfected 
Muscle Hirt^ 


3 X 10^' 


0 


No 


Linear ssDNA 
Encompassing 
rAAV 
Genome^ 


Isolated from 
Purified Virus 


3 X 10'^ 


2 


No 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 


Isolated from 
proviral plasmid 
(Hindin/PvuII)^ 


3x10^^ 


3 


No 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 
+ ligase*^ 


Isolated from 
proviral plasmid 
(Hindm/PvuII) 


3 X 10^^ 


>6xl0^ 


Yes 



* Purified virus was reconstituted into muscle homogenates prior to preparation of 
Hirt DNA. 



^ Viral DNA predominantly contained single stranded genomes as evident by 
Southern blot analysis against with ITR probe. However, small amount of dsDNA 
AAV genomes also existed and are likely due to reannealing of single stranded 
genomes during preparation. Purified viral DNA concentrations were determined by 
OD260 and 75 ng representing approximately 3x10^^ viral genomes were used for 
transformation of bacteria. 
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^ Hindm/Pvnll digestion was used to remove the entire rAAV genome from 
pcisAV.GFPSori. HindHI and PvuII leave 10 and 0 bps of flanking sequence 
outside the 5' and 3' ITRs, respectively. The linear dsDNA fragment (4.7 kb) was 
gel isolated following blunting with T4 DNA polymerase and the DNA 
concentration determined by OD250. One hundred and fifty ng of linear fragment 
representing approximately 3x10^^ viral genomes were used for transformation of 
bacteria. 

^ Linear dsDNA viral genomes (HindlH/PvuII blunted fragment) were treated with 
T4 DNA ligase prior to transformation of bacteria. 

^ The presence of head-to-tail circular AAV intermediates were confirmed by 
restriction enzyme digestion (Asel, PstI, and SphI) and Southern blotting against 
ITR probe. 

Fractionation of muscle Hirt DNA preparations. 

Preparative-scale fractionation of the muscle Hirt DNA was performed by 
1% agarose gel electrophoresis using the Bio-Rad Mini Prq) Cell (Catalog 
#170-2908). A 4.5 ml (10.5 cm) tubular gel containing 1 x TBE buffer was poured 
according to manufacturer's specification. A total of 20 ml Hirt preparation from 
one entire muscle sample was loaded on top of the gel. Electrophoresis was carried 
out at a constant current of 10 mA over a period of 5 hours. Sample eluent was 
drawn from the preparative gel apparatus by a peristaltic pump at a rate of 
100 ml/min and eluted into a fraction collector at 250 ml/fraction. The collected 
DNA was subsequently concentrated by standard ethanol precipitation and used to 
transform SURE bacterial cells by electroporation as described above. 
In vitro Persistence of AAV Circular Intermediates . 

Transgene expression and persistence of AAV circular intermediate plasmid 
clones were evaluated following transient transfection in Hela and 293 cells. 
Subconfluent monolayers of Hela cells in 24-well dishes were transfected with 
0.5 mg of either AAV circular intermediates (p81 or p87) or pCMVGFP using 
Lipofectamine (Gibco BRL Inc.). The cultures were then incubated for 5 hoxjrs in 
serum free DMEM followed by incubation in DMEM supplemented with 10% fetal 
bovine serum. All plasmid DNA samples used for transfections were spiked with 
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pRSVlacZ (0.5 mg) as an internal control for transfection efficiency. At 48 hours 
post-transfection, cells were passaged at a 1 : 10 dilution and allowed to grow to 
confluency (day 5), at which time GFP clones were quantified for size and 
abundance using direct fluorescent microscopy. The percent of 
beta-galactosidase-expressing cells was also quantified at this time point by X-gal 
staining. At 5 days, cells were passaged an additional time (1:15 dilution) GFP 
clones were quantified again at day 10. The persistence of plasmid DNA at 
passage-5, 7, and 10 days post-transfection was evaluated by Southern blot analysis 
of total cellular DNA using ^^P-labeled GFP probes. To determine whether the 
head-to-tail ITR array within circular intermediates was responsible for increases in 
the persistence of GFP expression, the head-to-tail ITR DNA element was 
subcloned into tiie pGL3 luciferase plasmid to generate pGL3(ITR). The 
head-to-tail ITR DNA element was isolated from a monomer circular intermediate 
(p81) by Aatn and Haell double digestion and subsequently inserted into the Sail 
site of pGL3 (Promega) by blunt ligation. The resultant plasmid pGL3(rrR) 
contains the luciferase reporter and head-to-tail ITR element 3' to the polyA site. 
The integrity of the ITR DNA element within this plasmid was confirmed by 
sequencing. The persistence of tiransgene expression firom pGL3(ITR) was 
compared to that of pGL3 by luciferase assays on transiently ti-ansfected Hela cells 
as described above and analyzed at 10 days (passage-2). Transfection efficiencies 
were normaUzed using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 
Results 

AAV Circular Intermediates Represent Stable En isomal Forms of Viral DNA 
Associated with Long-term Persistence of Transe ene Expression in Muscle. 

To evaluate the molecular characteristics of rAAV genomes in muscle, a 
rAAV shuttle viral vector (AV.GFP3ori) was utiUzed which harbors an ampicilUn 
resistance gene, bacterial origin of repUcation, and GFP reporter gene (Figure 1 A). 
This recombinant virus was used to evaluate the presence of circular intermediates 
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by bacterial rescue of replication competent plasmids. In these studies, delivery of 
AV.GFPSori (3 X 1010 particles) to the tibialis muscle of mice led to GFP transgene 
expression which peaked at 22 days and remained stable for at least 80 days 
(Figure 4A). These results confirmed previous successes in rAAV mediated gene 
transfer to muscle (Kessler et al., 1996; Herzog et al, 1997; Xiao et al., 1996; Clark 
et al., 1997; Fisher et al., 1997). The formation of circular intermediates was 
evaluated by E. coli transformation of Hut DNA harvested from muscle at 0, 5, 10, 
16, 22, and 80 days post-infection with AV.GFPSori. 

In these muscle samples, circular intermediates were found to have a 
characteristic head-to-tail structure with 1-2 TTR repeats. The most abundant form 
included two inverted ITRs within a circularized genome (Figure 4B, clone pi 7). 
This figure also depicts a less frequent form (< 5%) of circular intermediates 
observed, p439, with undetermined structure. When this type of replication 
competent plasmid was seen, it was not included in the quantification of 
head-to-tail circular intermediates since its structure could not be conclusively 
determined. The total abundance of muscle Hirt derived head-to-tail cfrcular 
intermediates (with 1-2 ITRs) demonstrated a time-dependent increase Ihat peaked 
with transgene expression at 22 days and slightly decreased by day 80 (Figure 5A). 
Increased diversity in the length of ITR arrays within circular intermediates was seen 
at longer time points. For example. Figure 5B demonsfrates several isolated circular 
mtemediates with 1-3 ITRs isolated from 80 days muscle Hirt samples. This is in 
contrast to tiie more uniform structure of circular intermediates with two ITRs in a 
head-to-tail conformation at 5-22 days post-infection. 

To evaluate the potential for artifactual rescue of linear rAAV genomes by 
recombination in bacteria, several control experiments were performed. First, 
uninfected control muscle Hirt preparations, spiked with an equal amount of rAAV 
virus used for in vivo infection of muscles, failed to give rise to replicating plasmids 
following transformation of E. coli. Second, when a blunted linear double stranded 
HmdHI/PvuII fragment isolated from pcisAV.GFP3ori (encompassing the entire 
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rAAV genome) was used to tonsform bacteria, no ampicillin resistant bacterial 
colonies were obtained. The addition of T4 ligase to this fragment, however, led to 
significant numbers of bacterial colonies. Third, when purified single stranded 
rAAV DNA was used for transformation, no bacterial colonies were obtained. As 
summarized in Table 1, these results confirm that in the absence of productive 
infection, rAAV genomes themselves are incapable of recombining into repUcation 
competent plasmids in bacteria. Hence, in vivo circularization of rAAV genomes is 
a prerequisite for rescuing autonomously rephcating plasmids in E. coli with this 
shuttle vector. 

Molecular weight of circular intermediates sugges t a conversion from monomer to 

multimer forms over time. 

To ftirther characterize the circular intermediates isolated from miiscles, Hirt 
samples from 22 days and 80 days post-infected muscles were size fractionated by 
continuous-flow gel electrophoresis (BioRad). As shown in Figure 6, the majority 
of circular intermediates at 22 days post-infection size fractionated at a molecular 
weight of less than 3 Kbp. Very few clones were isolated from fractions between 
3 to 5 kb and no clones were obtained from fractions larger than 5 kb at this time 
point. Furthermore, this size fractionated molecular weight of in vivo Hirt derived 
circular intermediates at 22 day time points correlated with that of head-to-tail 
monomer undigested cfrcular intermediate plasmids rescued in bacteria from this 
same time point (approximately 2.5 kb). These data suggest that at early time points 
post-infection in muscle, the predominant form of circular intermediates likely 
occurs as monomer genomes. The lower mobiUty of this fraction as compared to 
repUcation form monomer (Rfrn=4.7 kb) and dimer (Rfd=9.4 kb) genomes provides 
indirect evidence that these forms are not responsible for rescued plasmids in these 
Hirt samples. Interestingly, when 80 day muscle Hirt samples were size 
fractionated, more clones ware retrieved from higher molecular weight fractions 
ranging from 3-12 kb (Figure 6). This shift in the molecular weight of circular 
intermediates indicates the potential for recombination between monomer forms in 
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the generation of large circular multimer genomes. Such concatamerization has 
been previously observed in muscle and has traditionally been hypothesized to 
involve linear integrated forms of the AAV genome (Herzog et al., 1997; Xiao et al., 
1996; Clark et al, 1997; Fisher et al., 1997). This data sheds new hght on the 
molecular characteristics of these persistent AAV genomes and suggests that they 
are in fact circular and episomal. Based on yields of retrievable circular plasmids 
reconstituted in Hirt DNA, the efficiency of bacterial transformation, and the initial 
innoculum of virus, we estimate that approximately 1 in 400 viral DNA particles 
circularize following infection in muscle (Table 2), 



Table 2. Yield of Circular Intermediate Isolation from Hirt DNA 



Bacterial 
Transformation 


Starting Number 
of Plasmid or 
AAV Genomes 


Actual Number 
of Amp^ cfii 


Adjusted Yield 


Hirt DNA from r AA V 
Infected Muscle ^ 


3x 10^^ molecules 


5x10^ cfii 


5x10^ cfii^ 


Hirt DNA + 230 ng 
LacZPlasmid 


3x 10^^ molecules 


2x 10^ cfii^ 


2x10^ cfii 


230 ng LacZ Plasmid 


3x10^^ molecules 


2x 10^ cfu 





^ The actual amount of Hirt used for transformation was 3/20 the entire Hirt DNA, 
The niunbers have been adjusted to reflect viral innoculum and yields for the entire 
muscle. 



^ Plasmid DNA was spiked into mock infected muscle homogenates prior to 
isolation of Hirt DNA. This reconstituted Hirt DNA was then used for 
transformation of bacteria. 

*^ The actual microgram amounts of plasmid used in reconstitution experiments was 
10 ng. The numbers have been adjusted for comparison to normahze the number of 
plasmids genomes to that used in AAV experiments. Control LacZ plasmid was 
approximately 7000 bp with a molecular weight of 4.6 x 10^ g/mole. 

^ The average of several experiments indicates an approximate 100-fold reduction 
in the number of cfu recovered from bacterial transformations with DNA isolated 
from Hirt extract spiked with plasmids as compared to transformation with an 
equivalent amount of plasmid DNA alone. 
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^ Adjusted yield indicate approximately 1 in 400 AAV genomes circularize in vivo. 

Given the fact that not all rAAV particles likely contain functional DNA 
molecules and intermediates may integrate, these calculations may represent an 
underestimation. 

AAV Circular Intermediates Demonstrate Increased Persistence as Plasmid Based 
Vectors. 

Based on the finding that circular AAV intermediates were associated with 
long term persistence of transgene expression in muscle, rAAV circular head-to-tail 
intermediates may be molecular structures of the AAV genome associated with the 
latent Ufe cycle and increased episomal stability. Several aspects of the structure of 
AAV circular intermediates may account for their increased stability in vivo. First, 
circularization of AAV genomes may create a nuclease resistant conformation. 
Secondly, since the only viral sequences contained within circular intermediates are 
the head-to-tail ITR array, these sequences might bind cellular factors capable of 
stabilizing these structures in vivo. Several studies have demonstrated increased 
persistence of transgene expression with plasmid DNA encoding viral ITRs (Philip 
et al, 1994; Vieweg et al., 1995). The results described above provide a functional 
explanation for the increased persistence through the association with circular 
intermediate formation as part of the AAV life cycle. 

To more closely evaluate the persistence of AAV head-to-tail circular 
intermediates, several in vitro experiments were performed by transfecting these 
intermediates into Hela cells and assessing the stability of plasmid DNA and 
transgene expression by GFP clonal expansion. Results from Hela cell transfection 
experiments demonstrated that two monomer head-to-tail circular intermediates 
(p81 and p87) studied gave rise to a 10-fold higher number of five and ten day 
transgene-expressing clones, as compared to a control pCMVGFP plasmid lacking 
the ITR sequences (Figures 7A and B). Additionally, the size of GFP positive 
colonies at 5 days post-transfection was three-fold larger in Hela cells transfected 
with p81 and p87, as compared to the pCMVGFP control vector (Figures 7 A and B). 
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These studies suggest the AAV circular intermediates have increased stability of 
transgene expression and substantiate findings in muscle. 

To confirm the increased molecular persistence of head-to-tail circular 
intermediates following transfection into Hela cells, total DNA (low and high 
molecular weight) was isolated firom cultures of pCMVGFP and p81 transfected 
Hela cells at various passages post-transfection and analyzed by Southern blotting. 
Southem blots hybridized to ^^P-labeled GFP probes demonstrated a significantly 
higher level of p81 plasmid DNA at passage-7 as compared to the control vector 
lacking the head-to-tail ITR sequence (Figure 7C). The majority of signal in 
undigested DNA samples was associated with a 4.7 kb band migrating at the 
approximate size of the uncut monomer plasmids. Together with the fact that the 
majority of signal firom all cell cultures in Figure 7C disappeared by passage- 10, 
these data suggest that these plasmids predominantly remained episomal. Thus, in 
both muscle and Hela cells, increased persistence of AAV circular intermediates is 
correlated with stable transgene expression. 
ITR arrays are responsible for increased persistence. 

To investigate whether the head-to-tail ITR DNA element was responsible 
for the increased persistence of circular intermediates, we cloned this DNA element 
into a secondary luciferase vector (pGL3) to give rise to pGL3(ITR). Transient 
transfection experiments in Hela cells demonstrated a five-fold increase in the 
persistence of luciferase expression in serially-passaged cultures at 10 days in 
pGL3(ITR) as compared to that of pGL3 transfected (Figure 7D). These findings 
support the hypothesis that the head-to-tail ITR DNA element contained within 
circular intermediates is responsible for mediating the increased persistence of 
transgene expression and suggest a mechanism by which these molecular 
intermediates may confer stabiHty to AAV genomes in vivo. Furthermore, increases 
in the stability of transgene expression conferred by this element appear to be 
primarily context independent, since the head-to-tail ITR element was 3' to the 
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luciferase gene in pGL3(ITR) and 5' to the GFP transgene in AAV circular 

intermediates. 

Discussion 

Characterization of integrated proviral structures in different cell lines has 
demonstrated head-to-tail genomes as the predominant structural forms for both 
wild type and recombinant AAV (McLaughlin et al., 1988; Cheung et al., 1980; 
Duan et al., 1997). This is in contrast to the head-to-head and tail-to-tail structures 
obsCTved in AAV replication intermediates (Rfin and Rfd). Both Rfin and Rfd 
configurations have also been demonstrated in rAAV infected cells and enhanced 
conversion of ssAAV genomes to double stranded Rfin and Rfd forms has been 
suggested as a mechanism for augmentation of rAAV transduction by adenovirus in 
cell lines (Ferrari et al., 1996; Fisher et al, 1996). However, it is plausible that the 
mechanisms responsible for the formation of Rfin and Rfd molecules are different 
from pathways which lead to long-term transgene expression, hi support of this 
hypothesis is a recent study evaluating augmentation of rAAV fransgene expression 
by adenovirus in liver (Snyder et al., 1997). These studies have demonstrated that 
co-infection of the liver with adenovirus and rAAV enhances short term transgene 
expression while long term expression was no different than rAAV alone. The exact 
mechanism for the formation of head-to-tail circular intermediates is not clear, 
however similar structures have been demonstrated to act as pre-mtegration 
intermediates for retrovirus (Varmus, 1982). In this regard, circularized retroviral 
genomes with one and two viral LTRs have been proposed. In addition, circular 
pre-integration uitermediates have also been suggested by recent studies on wtAAV 
integration (Linden et al., 1996b). The demonstration that circular intermediates 
exist in rAAV infected muscle explains several features of latent phase infection 
with rAAV vectors including proviral structure and stable episomal persistence. 

Previous studies have suggested that rAAV genomes delivered to muscle 
might persist as head-to-tail concataners (Herzog et al., 1997; Clark et al., 1997; 
Fisher et al., 1997). However, it is currently unknown whether these concatamers 
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exist as free episomes or as integrated proviruses in the host genome. The results 
described above, i.e., demonstrating prolonged persistence of head-to-tail circular 
intermediates at 80 days post-infection, suggest that a large percentage of rAAV 
genomes may remain episomal. The conversion of monomer circularized genomes 
to larger circularized multimers appears to be an aspect associated with long term 
persistence and likely represents recombinational events between monomer 
intermediates. Although the bacterial rescue strategy was not capable of 
satisfactorily addressing the size of multimers, our modified approach to size 
fractionating Hirt DNA prior to bacterial rescue of intermediates lends support to 
this hypothesis. Additional supportive evidence for increased recombination over 
time is the finding that greater variability in the length of ITR arrays was observed at 
longer time points post-infection. For example, at 5-22 days the majority of circular 
intermediates contained 2 ITRs in a head-to-tail fashion. This is in contrast to 80 
day time points where the lengths of ITR arrays ranges from 1-3 ITRs. Such 
diversity of ITR arrays in muscle infected with AAV has been previously found 
using PGR approaches (Herzog et al, 1997; Fisher et al., 1997). In addition, the 
30% dechne in the abundance of circular intermediates in muscle between 22 and 80 
days also supports a hypothesis that these molecular forms of AAV may represent 
pre-integration complexes. 

Given the fact that circular intermediates had long term persistence in 
muscle, certain structural features of these intermediates may affect episomal 
stabiHty of DNA. Previous studies have noted increased persistence of transgene 
expression from plasmids encoding AAV ITRs (Philip et al., 1994; Vieweg et al, 
1995). However, the physiologic significance of this finding has remained elusive. 
The present study, demonstrating the head-to-tail ITR arrays isolated from AAV 
circular intermediates can confer increased episomal persistence to plasmids 
following transfection in cell lines, gives a mechanistic framework for ITR effects 
on plasmid persistence. Furthermore, the correlation that AAV circular 
intermediates have increased persistence in cell lines in vitro^ lends support to the 
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hypothesis that these structures represent stable episomal forms following rAAV 
transduction in muscle. Stability of circular intermediates in vivo might be mediated 
by the binding of cellular factors to "HoUiday-like" junctions in ITR arrays which 
stabiUze or protect DNA from degradation. 

rAAV has been shown to be an efficient vector for expressing transgenes in 
various tissues in addition to muscle, such as brain, retina, liver, lung, and 
hematopoetic cells (Snyder et al., 1997; Muzyczka, 1992; Kaphtt et al., 1994; Walsh 
et al., 1994; Halbert et al., 1997; Koeberl et al., 1997; Conrad et al., 1996; Bennett et 
al., 1997; Flannery et al, 1997). Despite these advances in the application of rAAV, 
the mechanisms of in vivo rAAV-mediated transduction and persistence of 
toansgaie expression still remain unclear. Such questions as to the molecular state 
of rAAV following in vivo dehvery is highly relevant to the clinical ^plication of 
this viral vector. For example, should rAAV primarily persist as an randomly 
integrated provirus, the potential for insertional mutagenesis could present a major 
Aeoretical obstacle in the use of this vector due to the potential for mutational 
oncogenesis. The demonstration that rAAV can persist as episomes suggest that 
random integration and associated risks of malignancy may not be a major concern 
for this viral vector s>«tem. Additionally, the molecular determinants of AAV 
chcular intermediates associated with increased persistence m cell lines appear to be 
contained within the DNA elements encompassing the inverted ETRs. The isolation 
of this naturally occurring viral DNA element, which forms as part of the AAV life 
cycle and acts to stabilize circular episomal DNA, may prove usefiil in increasing 
the efficacy of both viral and non-viral gene therapy vectors. 

Example 3 

Evidence for Increased Episomal Persistence of AAV Circ ular Intermediates in a 
Model for in utero Plasmid-Based Gene Therapv 

Persistence of AAV circular intermediates were assessed by injection of 
plasmid DNA directly into the pronucleus of fertiUzed Xenopiis oocytes. Twenty- 
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five ng of the p81 isolate of AAV circular intermediates was injected at the single 
cell stage of fertilized Jfewopw^ oocytes. This plasmid was compared to the proviral 
plasmid pCisAV.GFPSori, which contains two ITRs separated by stuffer sequence in 
an alternative confirmation to ITRs in p81 . Figure 13 depicts the persistence of GFP 
5 plasmids as assessed by direct fluorescence of GFP. At this state of tadpole 

development, the fertilized oocyte has expanded fi-om a single cell to approximately 
10^ cells. 

These studies confirm that AAV circular intermediates (p81) confer a higher 
level of stability in development XewcpM^' oocytes than plasmids containing similar 
10 transcriptional elements and ITR sequences in an alternative confirmation. Given 
that in the case of p81 injected oocytes, tadpoles are completely fluorescent, the data 
suggests that some level of integration may have occurred. 
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it Example 4 

15 Liposome Mediated Transfer of Vectors of the Invention to the Airway and Muscle 
Studies evaluating the mechanisms of recombinant adeno-associated virus 
(AAV) transduction have identified a novel molecular intermediate responsible for 
episomal persistence. This intermediate is characterized by a circularized AAV 
genome with head-to-tail ITR repeats. Circular intermediates of rAAV were 
20 identified using a recombinant shuttle vector capable of propagating circularized 
viral genomes in bacteria. Pivotal experiments in cell lines demonstrate that the 
formation and persistence of these circular intermediates are augmented in the 
presence of helper adenovirus. These findings suggest that cellular factors induced 
by adenoviral gene expression may modulate both the formation and/or persistence 
25 of AAV circular intermediates. Furthermore, studies in muscle have demonstrated 
that following rAAV infection, the formation and persistence of AAV circular 
intermediates correlates with the onset and maintenance (at 80 days) of transgene 
expression, respectively. Moreover, a 300 bp firagment encompassing the head-to- 
tail inverted ITR repeats found in AAV circular intermediates when cloned into 
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heterologous expression plasmids can confer increased stability to those plasmids in 
HeLa cells. The structural aspects of AAV circular intermediates may lead to 
development of non-viral, plasmid based, gene transfer vectors with increased 
persistence of transgene expression. 

To determine whether AAV circular intermediates which differ in length 
and/or sequence of the ITR array are more efficacious plasmid based vectors for 
liposome-mediated gene transfer to the airway and muscle, several distinct forms of 
AAV circular intermediates are evaluated as plasmid-based delivery systems in three 
model systems of the airway including: 1) in vitro polarized primary airway 
epitheUal monolayers, 2) mouse lung, and 3) human bronchial xenografts. 
Persistence is evaluated at both the level of transgene expression (using GFP and 
lucif erase reporters) and at the level of episomal and integrated transgene derived 
DNA. Studies are performed to assess whether integration can be specifically 
enhanced by co-transfection with Rep DNA or mRNA. These studies also evaluate 
both the extent of integration and site specificity to AVSl sites in chromosome 19 of 
human model systems. 

Gene therapy using plasmid-based delivery systems have encountered 
several obstacles to efficient transgene expression. These obstacles include transient 
expression of transgenes and rapid degradation of DNA. In contrast, viruses have 
developed efficient mechanisms for transducing cells and expressing encoded viral 
genes. The molecular characteristics of AAV circular intermediates which confer 
increased persistence of transgene expression include a DNA element encompassing 
the head-to-tail ITR. Based on the findings that circular intermediates have increased 
episomal persistence in muscle following rAAV transduction, these structures may 
also have increased persistence as plasmid-based vehicles to the airway. 
Interestingly, several naturally occurring mutations which are found in 
approximately 50% of AAV circular intermediates affect the stability of the 
intermediate. 
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Several findings evaluating the efficiency of AAV circular intermediate 
formation fi*om recombinant viral vectors have suggested that these structures are 
augmented in abundance by the presence of the E2a adenoviral gene product. These 
molecular structures may represent preintegration intermediates which, in the case 
5 of wild-type AAV, would efficiently integrate into the cellular genome by Rep 
facilitated mechanisms. However, in the case of recombinant AAV genomes (in the 
absence of Rep proteins), evidence suggests that these structures have increased 
episomal stability. To test whether exogenous addition of Rep and/or E2a can 
increase the efficacy of AAV circular intermediates by modulating their stability 
10 and/or integration, co-transfection methods with Rep encoding plasmids and mRNA 
53 are conducted. Additionally, exogenously suppUed E2a DNA binding protein 

(DBP) may also enhance stability of AAV circular intermediates. Rep may increase 
the integration of circular intermediates while E2a may increase their episomal 
stability. Several observations including the association of E2a DBP with AAV 
13 1 5 genomes in the nucleus support a direct interaction between DBP and AAV circular 

intermediates. Furthermore, if DBP associates with AAV circular intermediates, its 
encoded nuclear localization sequence (NLS) may enhance nuclear sequestration of 
[J these plasmids in the nucleus. Alternatively, E2a may act to alter the persistence of 

AAV circular intermediates through the induction of cellular factors which interact 
20 with the ITR array. 

Liposome mediated gene transfer to the airway has considerable advantages 
due to the low level of toxicity. However, Umitations include transient low level 
expression in differentiated airway epithelia. Despite this apparent limitation, 
several laboratories have had considerable success with the use of cationic 
25 liposome-mediated gene transfer in several animal models including mouse and rat 
lung, and numerous laboratories have pursued clinical trials, which suggested that 
these vehicles may show promise for gene therapy of the cystic fibrosis (CF) lung. 
Thus, delivery of the present vectors in plasmid form via liposomes maybe a safe 
and effective vehicle for gene transfer to the airway. 
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To assess whether AAV circular intermediates may also have increased 
persistence in airway epithelial cells as seen in Hela cells, several distinct forms of 
circular intermediates delivered by liposome-mediated transfection into primary 
airway epithelial cells, are evaluated. Based on the diversity of ITR repeat elements 
between various isolated circular intermediates (i.e., including 0, 1, 2, and 3 ITRs), 
circular intermediates isolated from later time points in muscle may have been 
naturally selected for increased stabihty in vivo. Hence, the structural consistencies 
between AAV circular intermediates are identified which give mcreased persistence 
as plasmid based vectors for gene transfer. 

Circular intermediates containing the GFP reporter gene and 1, 2, and 3 ITRs 
are transfected into primary airway cultures and polarized epitheUal cell monolayers 
usmg the cationic Upid GL-67 (Genzyme Inc.). DNA to lipid ratios are optimized 
using a luciferase reporter. Additionally, the addition of EGTA, or the use of 
calcium-free media, can increase the extent of gene transfer about 10-fold, and may 
be included to enhance gene transfer to polarized epithelial monolayers. To evaluate 
persistence and expression of transgenes from circular intermediates, direct 
fluorescent microscopy and Southern blotting of both Hirt and genomic DNA with 
GFP P^^-labeled probes are utilized. ProUferating cultures of primary airway 
epithelial cells can be passaged up to 4 times during this analysis. In contrast, 
polarized epithelial monolayers are evaluated at 1 week intervals for DNA 
persistence for up to 6 weeks. Since GFP transgene expression may be low and 
difficult to detect by direct fluorescence, GFP is quantitated by fluorometer of ceil 
lysates. 

Following AAV transduction, circular intermediates may form within cells 
and certain structures of these intermediates may persist by virtue of affinity for 
cellular factors which bind at ITR arrays. If this is true, then it may be possible to 
select for and isolate optimal circular intermediates with increased persistence in 
airway cells by batch screening of circular intermediates pools from rAAV infected 
airway epithelia. 
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Primary airway epithelia cell cultures are infected with AV. GFP3ori (MOIs 
of 1000 to 10,000 DNA part/cell) aad low molecular Hirt DNA is prepared at 5-15 
days post-infection. Hirt DNA containing circular intermediates from rAAV 
infected cells is used to then transfect primary airway epithelial cells from which 
Hirt DNA is prepared at 5-1 5 days post-transfection. This second Hirt isolation is 
then used to isolate repUcation competent plasmids following transformation into 
bacteria. This selection process may give rise to those populations of circular 
intermediates with increased episomal persistence in airway epithelial cells. 
Selected clones of circular intermediate plasmids isolated by this procedure are then 
tested individually for increased persistence following liposome mediated 
transfection. These studies are performed in a batch type screening in 24 well plates 
using two serial passages for persistence. Once plasmids having increased 
persistence are isolated, their structure and sequence of ITR arrays are characterized. 
Since screening is performed on small-scale cultures, it may be necessary to 
implement semi-quantitative screening for DNA persistence within the first round of 
transfection using PGR methods. Candidate plasmids with a high level of increased 
persistence as compared to control plasmids which lack ITR sequences but contain 
the identical promoter-reporter element, are evaluated on a larger scale transfection 
amenable to analysis by Southern blotting of total DNA. 

To evaluate selected circular intermediate structures in vivo, two models 
including mouse lung and the human bronchial xenograft are employed. 10 wk 
BalbC mice are transfected with GL-67/DNA complexes at a ratio of 25 |ig 
plasmid/25 |xg lipid in an iso-osmotic solution of Dextrose. At 1, 5, 10, 15, and 
20 days post-transfection lungs of mice are harvested for immunofluorescent 
detection of GFP in formaUn fixed sections and for quantitative fluorometry of 
tissue lysates. Southern blots are employed to evaluate the persistence of plasmids 
in Hirt and genomic DNA. In addition to evaluating the persistence of selected 
circular intermediates which have the highest level of persistence with in vitro 
models, luciferase constructs are evaluated in which the ITR array has been cloned 
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either 5' or 3 ' to the reporter gene. Furthermore, the use of luciferase reporters 
allows for more sensitive assessment of transgene activity in cell lysates. 

Similarly, in vivo persistence of transfected circular intermediates and 
heterologous plasmids containing ITR arrays found within circular intermediates is 
evaluated in human bronchial xenografts. 

Findings evaluating the effects of adenoviral co-infection on circular 
intermediate formation and persistence have suggested that E2a DBF leads to a 10- 
fold increase in the abundance of circular intermediates as compared to E2 deleted 
virus. Furthermore, studies with El-deleted virus have demonstrated that the 
persistence of circular intermediates in Hela cells is increased at 72 hours post- 
infection. These studies suggest that E2a DBF may augment circular intermediate 
formation and/or increase the stability of these structures by an unknown 
mechanism. E2a DBF may interact directly with circularized genomes and/or 
induce cellular factors which interact with sequences in these AAV genomes. Since 
DBF encodes an NLS, this protein may act to shuttle circular intermediates to 
regions of nucleus that allow for increased stabihty of these structures. NLS 
sequences have been shown to cooperatively interact with nucleolar targeting 
sequences and hence we will also evaluate if subnuclear targeting is important in 
maintaining the increased stability of circular intermediates containing ITR arrays. 
Fxirthermore, it is currently unknown where circular intermediates form in the cell 
and it remains plausible that they may form in the cytoplasm or nucleus. Hence, if 
DBF associates directly with circular intermediates, it may act as an NLS for DNA 
to enter the nucleus as well. 

Several in vitro reconstitution models are used to investigate the interaction 
of circular intermediates with DBF and their affect on in vivo persistence following 
DNA transfection in Hela cells. Furthermore, results evaluating the affects of 
various mutant adenoviral vectors on circular intermediate and Rfm/Rfd formation 
have suggested ttiat these two types of intermediates occur by independent pathways 
indicative of latent and lytic infection, respectively. Li the setting of wild type 
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AAV, circular intermediates may be pre-integration complexes, which in the 
presence of Rep, eiSciently integrate into the host genome. In contrast, in the 
absence of Rep, circular mtermediates may accumulate episomally in rAAV infected 
cells. To this end, methods of supplementing Rep function may be capable of 
enhancing integration of plasmid based delivery of AAV circular intermediates. 
Thus, experiments in which co-transfection of circular intermediate plasmids with 
Rep expression plasmids or mRNA are conducted. 

To investigate whether DBF can augment the stability of circular 
intermediates by increasing targeting to the nucleus, a Hela cell line (gmDBP6) is 
utihzed which encodes an inducible E2a gene under a dexamethasone responsible 
element. This cell line gives rise to high levels of DBF in nuclear extracts by 
Western blot following treatment with dexamethasone. gmDBF6 cells (+ /- DEX) 
are transfected with various AAV circular intermediate plasmids containing 0, 1, 2, 
and 3 ITRs and total cellular and nuclear plasmid content evaluated by subcellular 
fractionation using Southem blotting against GFP probes. The time course of these 
studies is initially within the range of 12 hours to 4 days post-transfection. 
Transgene expression is evaluated by fluorometry (in cell lysates), and fluorescent 
microscopy (in viable cells), for GFP and luminescence for luciferase. Hela cells 
have demonstrated that immediate increases in transgene expression from AAV 
GFP circular intermediates as compared to control GFP plasmids occur as early as 
24 hours post-transfection. Thus, certain cellular factors may facilitate an 
immediate accumulation of circular intermediates in the nucleus. DBF may invoke 
this increase by either direct interactions with ITR sequences or by the induction of 
cellule factors. To evaluate the potential for direct interactions between DBF and 
circular intermediates, various form of ITR arrays found within circular 
intermediates are end-labeled with y-ATF^^ and evaluated for binding by 
electrophoretic mobility shift assays to nuclear extracts from gmDBP6 cells (+/- 
DEX). Supershifts, with DBF antibodies and competition experiments with cold 
ITR sequences and non-specific DNA, are used as controls for specific bindiug. 
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In a second model system aimed at evaluating the potential of DBP for 
shuttling and/or sequestering of circular intermediates to the nucleus, microinjection 
experiments in oocytes are performed with 50 ng of plasmid DNA of circular 
intermediates with and without 50 ng of DBP mRNA. Experiments initially 
evaluate the time course of GFP transgene expression (+/- DBP cRNA) by direct 
fluorescent microscopy. If major differences are seen, quantitative fluorometry of 
individual whole oocytes in 96 well plates is conducted. Similar studies on nuclear 
targeting in the presence of DBP can also be evaluated in this model by pooling 
microinjected oocytes for nuclear isolation and Southern blot analysis. 

A third experimental model to evaluate nuclear targeting and/or 
accumulation of circular intermediate vectors in the presence and absence of DBP 
involves the microinjection of fluorescently labeled plasmid DNA into the 
C3rtoplasm and real time imaging to follow the nuclear accumulation of DNA. The 
DNA fluorescent dye, TOTO-1, is used to label DNA prior to injection. This dye 
forms an extremely stable complex with negligible diffusion and re-incorporation 
into nuclear DNA following transfection into polarized airway epithelial cell 
monolayers. Co-localization of DBP with vrtAAV DNA genomes at focal hot spots 
within the nucleus supports the observation that nucleolar targeting may be 
important for persistence. These experiments are also performed in primary airway 
epithelial cells and in vivo models of the airway by either co-transfection of circular 
intermediates with DBP expressing plasmids and/or mRNA. 

The effects of Rep co-transfection on the integration of circular intermediate 
plasmids is also evaluated. Two methods are used to express Rep including: 1) co- 
transfection with Rep expressing plasmids, md 2) co-transfection with Rep 
encoding mRNA. Initially, Hela, CFTl, and IB-3 cells are tested, as transformed 
cells may be more amenable to expansion and evaluation of integration. Both CFTl 
and IB-3 cells represent airway epithelial cells. Experiments are performed by 
cationic liposome (GL-67) mediated transfection of circular intermediate DNA with 
varying doses of a Rep-containing expression vector, e.g., pCMVRep. The extent of 
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integration is also evaluated by two criteria. Southern blotting of Hirt and genomic 
DNA and clonal expansion of GFP expressing cells. Since Southern blot has an 
approximate limit of sensitivity of 1 integrated plasmid molecule per 10 cellular 
genomes, clonal expansion may be necessary to evaluate persistence in less 
transfectable cells such as CFTl and IB-3 cells. Cell lines are evaluated over the 
course of 1-10 passages. 

Sustained expression of Rep by plasmid mediated co-transfection may be 
toxic to cells, hence co-transfection with Rep mRNA is also evaluated. Cationic 
Uposome:mRNA mediated transfection has been previously shown to work in cell 
Hues and although the level of expression is much more transient than for DNA, in 
these studies it may be an advantage. Initial studies are performed with in vitro 
transcribed Rep mRNA alone to evaluate the |xg amount of mRNA needed for Rep 
expression as determined by Western blot. Once the threshold for detectable Rep 
expression is estabUshed, increasing amounts of Rep mRNA are co-transfected with 
circular intermediate DNA. Similar assays are used as described above to evaluate 
the extent of AAV circular intermediate integration. If fmdings suggest that 
increased integration if facilitated by Rep, the site specificity of this integration can 
be evaluated by cloning GFP expressing cells after the 10th passage by serial 
dilution. These GFP expressing clones are expanded and genomic Southern blots 
assessed with both GFP and AVSl specific probes. By evaluating a number of 
restriction enzymes which either do not cut or cut once within the circular 
intermediate plasmid, it will be determined whether integration has occurred at the 
AVSl loci. 

To test whether secondary structure rather than primary sequence is the 
important determinant of increased episomal stability of AAV circxxlar 
intermediates, synthetic DNA sequences are generated with identical secondary 
structure to several ITR arrays in circular intermediates. The primary sequence is 
completely altered and bares no resemblance to sequences contained within native 
AAV ITRs. These synthetic DNA sequences are tested for their ability to confer 
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increased episomal stability to heterologous plasmids in several model systems 
including: l)lJieairway, 2) muscle, 3) and developing Xewcpw^ embryos. The 
devGloprng Xenopus embryo model is ideal for testing integration and persistence of 
plasmid based vectors for application of in utero gene therapy. If synthetic DNA 
sequences with similar secondary structure to ITRs are found to confer increased 
persistence to plasmid based vectors, then determinants for protein binding which 
facilitate persistence are independent of primary base sequence. These studies allow 
the optimization of the secondary structural requirements by synthesizing a wide 
range of DNA molecules with varying degrees of palindromic repeats. Furthermore, 
the secondary structure may not bind proteins directly but facilitate recombination of 
plasmids to large concatamers which have increased episomal stability or enhanced 
integration efficiencies. 



Example 5 

Delivery of Multiple Genes through Intermolecular Concatamerization 
Methods 

Recombinant AAV vectors . 

Two rAAV vector stocks were generated for use in these studies, 
AV.GFPSori (Example 1) and AV.Alkphos (also known as CWRAPSP, a gift of 
Dusty Miller) (Halbert et al., 1997). Virus stocks were generated by co-transfection 
of 293 cells with either pCisAV.GFP3ori or pCWRAPSP along with pRep/Cap, 
followed by co-infection with recombinant Ad.CMVlacZ helper virus (Example 2). 
rAAV was then purified through three rounds of CsCl density gradient 
centrifizgation as previously described by Duan et al. (1997). Purified viral fi-actions 
were heated at 60^C for 1 hour to inactivate any residual contaminating helper 
adenovirus. The yields for AV.GFP3ori and AV.Alkphos were 1 x 10^^ and 7 x 10" 
particles per ml, respectively, as determined by slot blot hybridization with ^^P- 
labeled GFP or Alkphos probes. Infectious titers determined by infection of 293 
cells with rAAVs were 1.1x10^ lU/ml (AV.GFPSori) or 8.6 x lO^IU/ml 



74 



T 



8 - 



if?: ; 



(AVAlkphos). Controls testing for contamination of rAAV stocks with wtAAV by 
anti-Rep immunocytochemical staining in rAAV/Ad.CMVlacZ co-infected 293 cells 
were negative (limit of sensitivity is less than 1 infectious wtAAV particle per 10^^ 
DNA particles of rAAV). Similarly, histochemical staining for p-galactosidase in 
5 rAAV infected 293 cells showed no detectable contamination with helper 
adenovirus in 10^^ DNA particles of rAAV (limit of sensitivity). 
Infection of muscle tissue and evaluation of transgene expression. 

The C57BL/6 mice used for these experiments were housed in a virus-free 
animal care facihty and were maintained under strict University of Iowa and NIH 
1 0 guidelines, using a protocol approved by the Animal Care and Use Committee and 
faciUty veterinarians. Four to five week old mice received bilateral 30 |il injections 
of a mixture of both AV.GFPSori and AV Alkphos into the tibialis anterior muscle 
h (5x10^ DNA particles of each virus per nmscle). Controls included unhyected 

muscles and muscles receiving injections of one of the viruses alone. At 14, 35, 80, 
15 and 120 days post-infection, animals were euthanized and tissues were harvested for 
evaluation of transgene expression and preparation of low molecular weight Hirt 
DNA. For each experimental time point, at least 3 independently injected muscles 
were evaluated. 

Li all experiments, GFP fluorescence was visualized in freshly excised 
20 muscle tissue prior to processing. A portion of the same muscle was fixed with 2% 
paraformaldehyde in phosphate buffered saline, and cryoprotected in graded sucrose 
solutions before embedding in optimal cutting temperature medium (OCT). 
Sections (6 |am) were then evaluated for GFP expression directly and Alkphos 
expression following heat inactivation of endogenous Alkphos and histochemical 
25 staining for Alkphos activity (Engelhardt et al., 1995). To confirm dual localization 
of GFP and Alkphos expression in the same muscle fibers, either serial sections 
were evaluated for GFP and Allqphos expression or the same section was furst 
photographed for GFP expression followed by histochemical staining for Alkphos 
and re-imaging of the same field. 
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Rescue of circular intermediates from muscle Hirt DNA > 

Low molecular weight Hirt DNA was prepared from 20 mg specimens of 
injected muscles from 3 animals at each time point (Example 2). Hirt DNA (4 |il; 
1/5 of the total volume) was then used to transform 50 |il of electrocompetent SURE 
cells (Stratagene) using a BioRad E, coli electroporater and 0.1 |xm cuvettes. 
Colonies resulting from each bacterial transformation were quantified, and plasmids 
from 20 colonies from each muscle Hirt DNA sample were purified for analysis. It 
should be noted that only circular forms carrying the Amp resistance gene and the 
bacterial origin of repHcation from AV.GFP3ori are rescued by bacterial 
transformation (Duan et al., 1998). Control experiments reconstituting 5 x 10^** viral 
DNA particles into uninfected muscle extracts prior to Hirt DNA preparation failed 
to give rise to replication competent plasmids in the rescue assay (Duan et al., 1998). 
Additional controls in Duan et al. (1998) using AV.GFPSori virus also demonstrated 
that linear double stranded and single stranded purified viral DNA genomes do not 
give rise to replication competent plasmids following transformation into E colL 
Characterization of encoded genes in rescued circular intermediates . 

Several assays were used to characterize the extent of intermolecular 
recombination between independent circular viral genomes by evaluating the 
number and type of encoded genes in rescued plasmids from Hirt DNA of muscles 
co-infected wilh AV.GFPBori and AV.Allq)hos. Initial analysis involved the bulk 
evaluation of 60 rescued plasmids (20 from each of three muscle samples for each 
time point) by dot blot hybridization of mini-prep DNA with EGFP, Alkphos, and 
Amp ^^P-labeled DNA probes. In these studies. Amp hybridization served as a 
control to show that there was a sufficient quantity of DNA for the analysis. The 
percentages of Alkphos and/or GFP hybridizing plasmids were calculated by this 
method for each muscle sample. From this percentage, the total nxmiber of plasmids 
hybridizing to each probe in the Hirt DNA sample was calculated from the total 
CFU obtained in each transformation. In this analysis, each muscle sample was 
evaluated independently to determine the mean (+/-SEM) total Alkphos and/or GFP 
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hybridizing plasmids. A second evaluation involved the transfection of rescued 
plasmids into 293 cells using lipofectamine, followed by evaluation of GFP 
fluorescence and histochemical staining for Alkphos. To confirm that GFP and 
Alkphos co-expressing plasmids were indeed clonal and that both genes were 

5 encoded on the same plasmid, a selected group of five co-expressing plasmids were 
retransfcrmed into E.coli and colonies were re-isolated prior to repeating the 
transfection studies. In all cases, plasmids co-expressing the two reporter genes 
remained clonal tiirough this subsequent re-isolation. 
Structural analvsis of concatamer rAAV circula r intermediates. 

1 0 To further characterize the nature of isolated circular intermediates co- 

expressing both GFP and Alkphos transgenes, plasmid structure was mapped by 
Southern blotting and restriction enzyme analysis. The structural of five co- 
expressing circular intermediate plasmids were determined by digestion with Ahdl, 
Hindm, NotI, Hindm/Notl, Clal/Asel, and/or SnaBI and Southern blotting was 

1 5 performed with '^P-labeled GFP, Alkphos, and ITR probes. 
Results 

Strategy for characterizing mechanisms of rAAV circular intermediate formation. 

Efficient circularization of rAAV genomes has been previously demonstrated 
to occur in muscle in a time dependent fashion (Example 2). Furthermore, the 

20 conversion of monomeric to multimeric cu-cular rAAV intermediates occurred over 
time and was associated with long-term episomal persistence of AAV genomes. 
High molecular weight AAV circular genomes might form by either of the following 
two mechanisms, one involving the rephcation of monomer structures and the other 
through intermolecular recombmation between independent monomers. A rescue 

25 assay was developed using two separate rAAV vectors, AV.GFPSori and 

AV.Alkphos (Figure 14A), which allowed for the identification of independent viral 
genomes through unique transgenes. In this assay, circular form genomes were 
rescued in bacteria by virtue of Amp/ori sequences encoded in one of the two 
vectors (AV.GFPSori). A method for characterizing the extent of intermolecular 
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recombination between independent circular rAAV genomes was shown in Figure 
14B, 

Co-expression of independently encoded rAAV transgenes in muscle mvofibers . 

To confirm that myofibers can be co-infected at a high efficiency with the 
two rAAV vectors, the tibialis anterior muscle of mice was co-infected with 5x10^ 
DNA particles of both AV.GFPSori and AV.Alkphos. At 14, 35, 80, and 120 days 
post-infection, muscles were harvested and analyzed for transgene expression. 
Transgene expression from both reporters was weak but clearly visible in 14 day 
muscle samples. By 80 days post-infection, transgene expression was maximal and 
serial sections demonstrated expression of both Alkphos and GFP transgenes in 
overlapping regions of the muscle (Figures 15A-C). At this time point, 
approximately 50% of the fibers in the tibiaUs muscle expressed both transgenes. 
To confirm that co-infection of myofibers occurred with the two independent 
vectors, co-localization studies were performed on muscle sections by a serial 
staining procedure. These studies, depicted in Figure 15D, demonstrate four classes 
of myofiber transgene expression: 1) GFP positive only, 2) Alkphos positive only, 3) 
GFP/Alkphos positive, and 4) no transgene expression. The largest fraction of 
myofibers expressed both GFP and Alkphos transgenes. These results confirm that 
at the titers of virus used for infection, co-infection occurred in greater than 90% of 
transgene expressing myofibers. 

Rescue of bi-fimctional rAAV circular intermediates increases over time . 

To determine the extent of recombination between circular AAV genomes, 
circular form genomes were rescued as plasmids from low molecular weight Hirt 
DNA of muscle tissue co-infected with AV.GFP3ori and AV.Alkphos. Following 
transformation of Kcoli Sxire cells with Hirt DNA purified from infected muscles, 
the total number of GFP and Alkphos hybridizing Amp resistant bacterial plasmids 
was quantitated for each time point post-infection (Figure 16A and B) (Duan et al., 
1995), the abundance of circular AAV genomes rescued from AV.GFPSori 
increased over time. For each muscle sample (three for each time point) twenty 



78 



plasmid clones were evaluated for hybridization to GFP and Alkphos DNA probes 
and the total number of plasmids was back calculated from the total CFU for each 
individual muscle sample. Figure 16B demonstrates the mean (+/-SEM, N=3) total 
plasmids that hybridized to GFP or GFP/Alkphos probes at each time point. At 14 
days post-infection, GFP/Alkphos co-hybridizing plasmids were never observed, hi 
contrast, at time points after 35 days the percentage of GFP/Alkphos co-hybridizing 
plasmids increased with time and reached 33% by 120 days (Figure 16C). Since 
bacterial plasmid rescue can only occur through AV.GFP3ori genomes, this data 
suggests that recombination between independent Alkphos and GFP rAAV genomes 
takes place over tune. These resuhs are consistent with studies described 
hereinabove demonstrating a time dependent concatamerization of monomer 
circular rAAV genomes in muscle. 

To evaluate the ability of circular intermediates to express encoded 
transgenes, transient transfection studies were performed in 293 cells with rescued 
circular intermediate plasmids (Figures 17A-C). Between 85-90% of rescued 
plasmids hybridizing to GFP probes on slot blots also expressed the GFP transgene 
in this transfection assay (Figure 17D). The percentage of GFP expressing plasmids 
that also expressed Alkphos rose over time in concordance with the hybridization 
data (Figure 17D). However, approximately 40-50% of plasmids which were 
hybridization positive for Alkphos did not express the Alkphos transgene. This may 
represent recombinational deletion of the RSV promoter driving Alkphos expression 
which occurred during concatamerization at sites near the 5' ITR. These results 
demonstrate that intermolecular recombination between Alkphos and GFP derived 
circular intermediates occurs as part of the time dependent concatamerization 
process of rAAV in muscle. To confirm that ampUfied plasmids stocks expressing 
both reporter genes were actually clonal (i.e., one plasmid rather than two 
independent plasmids resulting from contamination), a select number of bacterial 
clones expressing both transgenes were re-isolated and the transfection assays were 
repeated. In all cases, plasmids expressing the two reporter genes remained clonal 
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through two rounds of bacterial cloning. Hence, dual reporter expression was not 
due to contamination of independent GFP and Alkphos expressing plasmids. 
Concatamerization of AAV circular intermediates occurs through unifoim 
intermolecular recombination between ITRs of independent viral genomes . 

To better understand the mechanisms of circular concatamer formation, a 
detailed structural analysis was performed of five bi-fimctional circular concatamers 
isolated Jfrom rAAV infected muscle samples. As previously described for the 
AV.GFPSori genome (Example 2), the conversion of monomeric circular AAV 
genomes to large multimeric circular concatamers with a predominant head-to-tail 
structure increased with time in muscle. To evaluate the structure of bi-functional 
circular concatamers, restriction enzyme mapping and Southem blot analysis using 
^^P-labeled EGFP, Alkphos, and ITR probes was employed. Results from five 
analyzed plasmids demonstrated between 3-6 genomes within these circular 
concatamers. Two representative structures from 35 and 80 day time points are 
shown in Figure 18. Several interesting conclusions can be made from this 
structural analysis. As described , head-to-tail oriented genomes could be seen in all 
isolated concatamers. However, several examples of head-to-head and tail-to-tail 
genome combinations of AV.Alkphos and AV.GFP3ori were also seen. Since head- 
to-head and tail-to-tail genome concatamers were never seen in muscles infected 
with AV.GFP3ori alone, there must be a selective disadvantage for bacterial 
rephcation when ori sequences are in either of these conformations. However, since 
the AV.Alkphos genomes do not contain a bacterial origin of replication, this 
orientation is permitted in chimeric concatamers. Second, noticeable deletions 
and/or loss of restriction sites close to ITRs were noted (Figure 17). It is not known 
whether deletions close to the ITR are a common event in the concatamerization 
process, but if so, this could account for the fact that only 60% of GFP/Alkphos 
hybridizing circular intermediates also expressed the Alkphos transgene. 
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Discussion 

Concatamerization of rAAV genomes has long been recognized in integrated 
proviral genomes. Recently, the association of this concatamerization process with 
the formation of high molecular circular genomes in muscle has suggested that this 
process may also be important in episomal persistence. The findings described 
herein demonstrated rescue of independent viral genomes within the same circular 
concatamer, suggesting that this process of concatamerization occurs through 
intermolecular recombination. Furthermore, at 14 days the predominant form of 
viral genome in muscle was circular monomers (Example 2), which correlates with 
the results described above demonstrating only GFP expression in rescued circular 
intermediates at this tune point. Together with the fact that bi-functional rescued 
circular concatamers increase with time, these results suggest that large concatamers 
form by recombination of monomeric circular precursor genomes. Furthermore, 
since an altemative model of concatamerization by rolling circular repUcation would 
be expected to yield only GFP expressing rescued plasmids in this system, this 
mechanism does not appear responsible for concatamerization. 

Based on the structural analysis of these bi-functional circular intermediates, 
recombination between monomeric circular rAAV genomes is likely facihtated 
through ITR sequences. Directionality of this recombinational event does not 
appear to play a significant role, since head-to-tail, head-to-head, and tail-to-tail 
oriented intermolecular concatamers were found. In addition, the extent to which 
recombination within ITR repeat regions occurs in bacteria is presently unknown 
and may account for the deletions and/or restriction site losses near ITR arrays. 
However, serial passaging of bi-fimctional circular AAV genomes in bacteria has 
suggested that the structure of these large concatamers is impressively stable in 
bacteria. 

Intermolecular recombination of rAAV genomes to form single circular 
episomes may be particularly usefiil for gene therapy. For example, large regulatory 
elements and genes beyond the packaging capacity of rAAV may become linked 
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after co-infecting tissue with two independent vectors (Figure 19). This strategy 
could also involve trans-splicing vectors encoding two independent regions of a 
gene which are brought together to form an intact splicing unit by circular 
concatamerization. 

For example, two independent vectors encoding two halves of the CFTR 
gene flanked by donor and acceptor splice site sequences are prepared. Expression 
of functional CFTR protein results after spUcing of RNA transcribed from a 
concatamerized genome comprising both halves of the gene in the sense orientation. 
One rAAV vector may comprise the furst 3.3 kb of the CFTR gene under the control 
of the RSV promoter and an in-frame splice donor site at the 3' end of the CFTR 
cDNA. The second rAAV vector encodes a splice acceptor intronic sequence, the 
3' 1.4 kb of the CFTR gene, and SV40 poly-adenylation sequences. To test for 
efficient splicing, a chimeric vector (pcDNA3.1CFTR-Donor/Acceptor) is 
introduced to Xenopus oocytes by nuclear injection of the vector, followed by two 
electrode voltage (TEV) clamp recording fimctional analysis of CFTR (Jiang et al., 
1998). mRNA transcripts are also analyzed for correct sphcing following 
transfection of pcDNA3.1CFTR-Donor/Acceptor into MDCK cells. Polarized 
airway epithelial cells grown at the air-Uquid interface are co-infected with the 
donor and acceptor CFTR AAV vectors CFTR gene expression in these cells is then 
monitored by both immunofluorescent localization and functional analysis of short 
circuit currents (Smith et al, 1992; Smith et al., 1990). Hirt analyses of episomal 
AAV species are used to correlate the efficacy and persistence of CFTR gene 
expression with the formation of AAV circular intermediates. 
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